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Preface 



This book is intended to be a realistic first course in quantum mechanics 
and as such no claim is made that the subject is covered comprehensively. 
However, it is hoped that the material included will not only give a good 
basic grounding in the subject but will introduce the student to the more 
advanced techniques essential for a serious study. 

After a brief survey of classical mechanics and of the historical back- 
ground of Quantum Theory, a postulate statement is given for Quantum 
Mechanics. One chapter is devoted to matrix representation and two 
chapters to approximate methods of solution. Group theory is introduced 
in chapter nine. This is a topic of growing importance. The final chapter 
is an introduction to Dirac's relativistic theory. 

This book should be useful to first degree students in physics, 
chemistry and mathematics and also to research workers in industry, 
polytechnics and universities. 

Problems are included at the end of each chapter and a few of these 
have been used to extend the theory slightly. SI units are used and the 
charge on the electron is taken as -e = - 1 -602 x 10"" 19 C. A list of 
references and further reading is given at the end of each chapter. 

I would like to thank my first teacher of the subject Professor H. C. 
Bolton whose influence I gratefully acknowledge. 

Tynemouth, H.C. 

June 1973. 



CHAPTER 1 



Classical Dynamics 



1.1 Introduction 

Although classical mechanics does not apply to the dynamics of atomic 
systems it certainly affords an excellent approximation to a wide range 
of phenomena. Quantum mechanics is a more general theory which con- 
tains classical mechanics as a limiting case and in fact historically quan- 
tum mechanics was developed by analogy with classical theory although 
this approach is not followed in this text. 

It is important that the reader understands something of the more 
general formulations of classical mechanics and is familiar with the terms 
Lagrangian, conjugate momentum, Hamiltonian and Poisson bracket. A 
brief review is given in this chapter but for a more complete treatment 
the reader is referred to Principles of Mechanics, by Synge and Griffith, 
or Classical Mechanics, by Goldstein. 



1.2 The Lagrange Equations 

The configuration of a mechanical system is determined when the values 
of a set of 'generalized co-ordinates' are given. The set of co-ordinates 
chosen may be determined by the symmetry of the system. Some or all 
of these generalized co-ordinates can be arbitrarily varied independently 
without violating the constraints of the system and the number of such 
co-ordinates is called the number of degrees of freedom of the system. 
In a holonomic system the number of degrees of freedom is equal to the 
number of co-ordinates. A smooth sphere in contact with a fixed plane 
constitutes a holonomic system, five co-ordinates being necessary to 
define the configuration, each of which may be varied independently. 



2 A FIRST COURSE IN QUANTUM MECHANICS 

On the other hand a rough sphere rolling on a fixed plan forms a non- 
holonomic system as there are now only three degrees of freedom whereas 
the number of co-ordinates is still five. The rest of this chapter is con- 
cerned with holonomic systems only. 




Figure 1.1 A smooth sphere in contact with a fixed plan. The mass centre is 
specified by two co-ordinates (x, y). These angles (Eulerian), specify the orientation 
of the sphere. There are five degrees of freedom. 



Lagrange's equation can be derived from Newton's laws of motion. 
Consider a system composed of N particles of masses mt (/ = 1 , . . ., N). 
At time t the position vector r,- of the particle is determined by the set 
of n generalized co-ordinates qj (/' = 1 , . . ., n), i.e. 

ti = r i {q ll ...,q n ,t) (1.1) 

(for N free point particles n - 3N). If it is assumed that the system has 
fixed constraints (i.e. scleronomic), then r,- does not explicitly involve 
the time. 

The velocity of the particle is then 

The n derivatives qj are called the generalized velocities of the system. 
The particle-velocity f,- is a function of all the variables qj, qj and from (1.2) 

%-'¥-- (L3) 

9(7; dq f 
and 

d /9r,- \ di t 



. la . a - (1.4) 

dr \aqf J oq ; - 

Suppose the particles each undergo a small displacement consistent 
with the constraints of the system such that all the forces remain con- 
stant during the displacement. In this virtual displacement the total work 
done is 

5W = 2F,-.5r,- (1.5) 
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where F,- is the force on the ith particle and Sr,- is its displacement. From 
Newton's laws of motion 

F/ = W/f, (1.6) 

and so 

8W = 2mft.Sr i . (1.7) 

i 

If it is assumed that the internal forces do not work then F,- may be 
identified with the externally applied force. 
Since 

then from 1 .3 and 1 .7 

The total kinetic energy of the system is 

T = h^m i tf (1-9) 

i 

!^-2m*.j^- 0.10) 



and 



iHfH4th4it)- <"» 



But 






d 


UJ 


dr \3^-/ 3<7/ 


and so 






_d_ 
dr 


(=) 


v- - ^r e 

= 2.m i r i . — + - 

/ 3?/ « 



(1.12) 
From (1.8) and (1.12) the work done may be expressed as 
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Generalized forces Q f may be introduced so that (c.f . (1 .5)) 

5W=2<2/H- (1.14) 

and these generalized forces are given in terms of the applied external 
forces. 

As the system is holonomic it is possible to consider a displacement 
involving one degree of freedom only, e.g. 

From (1.13) and (1.14) 
d dT dT 

d7Ur^ =Qi /=1 ' ••••«■ t 1 - 15 ) 

Only conservative forces will be considered and they can be derived 
from a potential function V(q lt . . ., q„) where SW = -5 V, i.e. 

-SV=lQ i 8q,- 
i 

and so 

a=- — • ( i.i6) 

Equation (1.15) becomes 

At dqj dqi dq t 

As the potential function does not depend on the generalized velocities 
it is possible to define a Lagrangian 

£(<7i»-- .,?,i><7i>. ■ ;Qn) = T(q 1 , . . .,q„,q u . . .,q„) 

-V(q u ...,q„) (1.18) 

such that 

d 31 3i „ , 

j-zr. — — = i = l,..., it. (1.19) 

it dq t dq t 

This final result is Lagrange's equations for a conservative holonomic 
system. 

As an almost trivial example consider the motion of a compound 
pendulum. A uniform rod of a length 2a and mass m is free to rotate 
about a horizontal axis through one end. This is a holonomic system 
with one generalized co-ordinate, the angle 6 (see Fig. 1.2). 
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Figure 1.2 Compound pendulum. 

The kinetic energy of the system is T = \l'd 2 where / is the moment of 
inertia about the pivot, i.e. 

T = \ma 2 6 2 . 

With a suitable choice for zero the potential energy is V = - mga cos 0. 
The Lagrangian function is then 

L = \maH 2 + mga cos 

and Lagrange's equation becomes 

%ad +g sin0 =0. 

For small oscillations the motion is simple harmonic with frequency 

„«I./S. 

2it K 4a 



1.3 Hamilton's Equations 

To derive Hamilton's equations it is necessary to generalize the concept 
of momentum. The Lagrangian of a point particle moving in a conserva- 
tive field is 

L=j(x 2 +y 2 +z 2 )-V(x,y,z). 
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The components of linear momentum in the x, y and z directions are 

. oL . dL . dL 

p x =mx= — , P y =my = — , Pz=mz=—-. 
ox by oz 

Accordingly, when the Lagrangian is expressed in terms of a set of 
generalized co-ordinates q^(J = 1, . . ., n) the n components of generalized 
momenta may be defined by 

oL 
Pi=Tr~. (1.20) 

dq t 

Pi is the rnpmentum conjugate to q t . 

For the compound pendulum discussed above 

Pe =%ma 2 6=Id. 

A Hamiltonian function is defined by 

H=2p iQi -L. (1.21) 

i 

From (1 .20) the components of momentum are expressed in terms of the 
q's and q's. This set of equations can be solved, for the ^'s in terms of the 
q's and p's and in this way the Hamiltonian is expressed as a function of 
the #'s and p's, 

H=H(q 1 ,...,q n , Pl ,...,p n ). (1.22) 

It is clear from (1 .2) and (1 .9) that the kinetic energy is a homogeneous 
quadratic function of the q's and so from Euler's theorem 

lq t ^2T. (1-23) 

From (1.18) and (1 .20) p t = bT/dijj and so for a conservative field (1 .21) 
becomes 

H=2T-(T- V) 

= T+V. (1.24) 

The Hamiltonian is equal to the sum of the potential and kinetic 
energies. 

To obtain Hamilton's equations consider a small change in the 
generalized co-ordinates. In terms of differentials 

2bL ^^ dL 

^7^,-2^— d<7,-. (1.25) 
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By definition of the generalized momenta p t = aL/dq t and from 
Lagrange's equations (1.19) 

oL . 
dq t 

Thus 

<lH=I.q i dp i -2pi<ki- 

i i 

As 

H=H(q 1 ,...,q„,Pi,-;Pn) 
then 

= -p.- and — - = q t . 0- 26 ) 

dq, ' *Pi 

These final relations are Hamilton's equations. (Canonical equations.) 

Hamilton's equations form a set of In first order equations in the In 
variables q t and p t and the co-ordinates and momenta enter the theory 
on equal basis. The complete solution of Hamilton's equations involves 
2n arbitrary constants. If the values of all the co-ordinates and momenta 
are known at some instant then the future state of the system is uniquely 
determined by Hamilton's equations. 

As a simple illustration consider a point mass moving under gravity. 
The kinetic energy is 

r = j(i 2 +y 2 +i 2 ) 

and the potential energy is V = mgz where z is the perpendicular height 
from some chosen zero and g is the acceleration due to gravity. 
The Lagrangian is 

L= — (x 2 +y* + z ) - mgz 

and the momenta conjugate to x, y, z are 

az, . 
p x = — = mx, p y =my, p z =mz. 
ox 

The Hamiltonian is 

H =T+V 

2m 
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Note that the Lagrangian is used to relate q t and p t and hence to obtain 
the Hamiltonian. 

A more interesting example is afforded by the rigid rotator. This 
system is composed of two point masses mj and m 2 , a fixed distance 




Figure 1 .3 Rigid rotator. C is mass centre. 

apart, rotating about the mass centre C, and arises in discussing the 
energy levels of diatomic molecules. Let C coincide with the origin of 
the co-ordinate system and let m x , m 2 be at the distances r i , r 2 respect- 
ively from C. This is a holonomic system with two degrees of freedom, 
the polar angles 6 and <p. The potential energy of the system is zero and 
the kinetic energy is 

T = \m x V\ +\m 2 V\ 

where V t , V 2 are the speeds oim l ,m 2 respectively. In terms of the 
polar angles 

T = ^m 1 [rj6 2 +r?sin 2 00 2 ] +%m 2 [r\d 2 +r\ sin 2 d<j> 2 ]. 
The moment of inertia about the mass centre is 

I = mir\ +m 2 r 2 
and the Lagrangian is 

L = T = $I(6 2 +sin 2 00 2 ). 
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The momenta conjugate to 6 and <j> are 

dL ■ dL , • 

and the Hamiltonian of the system is 

Conservation of energy 

The total energy of the system is 

E = T+ V = H(q iy . . .,q„,p u . . .,p„) 
and the rate of change of the energy is 

AE ^c(m . dH . \ 

•spidHM MM 
y>\dq/ dpj dpjdq,- 

= 0. (1.27) 

Throughout the dynamical motion the energy remains constant. This 
result of course was to be expected as only conservative forces have 
been considered (1.16). 

Poisson brackets 

Suppose F and G are two functions of the 2m variables qi,Pi- The Poisson 
bracket of F and G, denoted {F, G} is defined by 

{F, G}=> —- — — . (1.28) 

^\bq f dp, b Pi dq,J 

It is immediately obvious that 

{F G}=-{G,F} 
and 

{F, F}=0. (1-29) 

Simple important cases arise when F and G are qi and /?,-. 

fa/,<7/}=0, {Pi,Pj} = 0, {q h Pj} = t>ij, (1.30) 
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where 5 (/ - is the Kronecker delta defined by 

5, 7 = i#/ 

= 1 /¥=/. 

In classical mechanics Poisson brackets occur when discussing the 
rate of change of functions of the In variables qi and p,-. F is such a 
function and 



dF 
dt 



_v/if!- dF ■ \= V ( dF dH dF dH \ 



i.e. 



dF 

— ={F,H}. (1.31) 

at 

If {F, H} = then Fis a constant of the motion. 

There is a correspondence between Poisson brackets and the com- 
mutators of quantum mechanics. 



1 .4 Time Involved Explicitly 

In the more general case it may be possible to describe a system by a 
Lagrangian satisfying (1.19) but which explicitly involves the time. The 
conjugate momenta are defined in the usual manner (1.20) and so a 
time-dependent Hamiltonian may be found (1.21). Hamilton's equations 
(1 .26) still follow from the analogue of (1 .25) but in addition there is the 
result 



(1.32) 



(1.33) 



dt ~~ bt ' 
The appropriate generalization of (1 .27) is 

dH = bH 

dt ~ dt 

If F explicitly involves the time then (1 .31) becomes 

A system is conservative if either of the equivalent statements 

is satisfied. 
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An electron in an electromagnetic field 

For some systems it is not always obvious what the precise form of the 
Lagrangian should be. Consider an electron with charge -e in an electro- 
magnetic field with electric vector ^and magnetic induction vector B 
(S.I. units). The force exerted on the electron moving with velocity r is 



F = -e[«f+r AB]. 



(1.36) 



The field may be described by the scalar and vector potentials and A 
with 



B = curl A and 



9A 
-grad0-— . 



(1.37) 



In general these potentials are functions of position and time. These 
equations do not specify and A uniquely and it is convenient to 
impose the condition 



90 
div A + y£ — : 
ot 



0. 



(1.38) 



H and e are the permeability and permittivity of the medium. The 
equation of motion of the electron (mass m) is then 



9A 
mi = F = e grad + e — - er A curl A. 
bt 



(1.39) 



It will now be shown that the Lagrangian that describes the system is 
Z.(r,r,O = i"ir 2 +e0-er.A. (1-40) 



The generalized co-ordinates will be taken to be the cartesian co- 
ordinates of the electron and so, 



91 90 _ . bA) 



dXi dXj 



dxj 



Also 



dXj 



= mXj — eAj. 



(1.41) 



(1.42) 



Lagrange's equations (1.19) become 



90 bA i 
mxi =e- — + e— — 
dXi bt 



— e 



*2 



dA 2 
bxi 



bA 
bxo 



+ *3 



bA-; 
bxi 



bAi 
bx 3 



and similarly for x 2 and x 3 . These equations are simply the components 
of (1.39) so verifying that (1.40) is the correct Lagrangian. 
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The momentum conjugate to x t is from (1.42) 

Pi = mXj - eA t . (1-43) 

The conjugate momentum is not equal to the linear momentum unless 
A = 0, i.e. unless there is no magnetic field. The Hamiltonian is 

H = 2p/<7/ -L=p.i-L 
i 

= (mi — eA) . r — \rrir 1 — e<f> + e'r. A 

= $mf 2 -e<t>. (1.44) 

In terms of conjugate momenta 

H(r, p, = -J- (p + e A) 2 - e<j>. (1 .45) 

Im 

It is a simple matter to confirm that 

If the fields are static then and A do not depend explicitly upon the 
time and consequently neither do L nor # and so the energy is conserved. 
In a purely magnetostatic field = and the speed of the electron is a 
constant of the motion. 

1.5 Hamilton's Principle 



The calculus of variations 

Let A and B be two fixed points in the xy-plane with co-ordinates 
(xl-Vi)* ( x 2>y2) and l et Fbe a functional form of the independent 
variable x, a function y(x) and the derivative y' = dy/dx. For a given 
curve jy = y(x) through A and B the integral 

B 

I=\ F(x,y,y')dx (1.46) 

A 

has a definite value. This integral will in general take different values for 
different curves through the fixed points A and B. It is desired to find 
the curve y =y s (x) which makes / take a stationary value, if this exists. 
Let g(x) be an essentially arbitrary differentiable function such that 

g(x 1 ) = g(x 2 )=0. (1.47) 

Then y (x) =y s (x) + fig(x) where ji is small is the equation of a neigh- 
bouring curve through the points A and B. 
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Figure 1.4 

The integral (1 .46) may be regarded as a differentiable function of 0. 

B 

W = j F(x, y,(x) + &(*), y's(x) + &'(*)) dJC. 

A 

Since this integral is stationary when = then I'(Q) = 0. Differentiating 
under the integral sign gives 

| [g(x)F y (x, y s , y' s ) + g'{x)F y -(x, y s ,y' s )] dx = 0. (1 .48) 

A 

The second term can be integrated by parts and using (1 .47) equation 
(1.48) becomes 



j g(x) F y (x, y s ,y s ) - — F y '(x, y s , y s ) 



dx = 0. 



Since g(x) is arbitrary, the condition for / to take a stationary value is 

(1.49) 



bF jd_ (bF\ 
by ~~ dx [by' J 



This differential equation fory is called the Euler- Lagrange equation. 

The extension of the theory to include n variables y,- (j = 1 , . . ., n) is 
straightforward and the condition that the integral 
B 
I=$F(x,y 1 ,...,y n ,y\,...,y' n )dx (1.50) 

A 
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has a stationary value with respect to variations in the y's is that 

If the identification 
x-*t 

y'j^Qi 

is made, then the above theory shows that the Lagrange equations 



_d_ ( dL\ dL _ 
dt {dq/J dq f 







are equivalent to choosing the trajectories of the co-ordinates #,(r) so 
that 

B 

5 J Lit, q h . . ., q„,q u . . ., q n ) dt = 0. (1.52) 

A 

A represents the fixed initial configuration of the system at time t x , and 
B the fixed final configuration at t 2 . This is Hamilton's principle. Of 
course for conservative systems the Lagrangian does not depend explicitly 
on the time and the variational principle determining the equations of 
motion (1.19) is 
B 
5 j (r- V)dt = 0. (1.53) 

A 

Generally the stationary value of the integral is a minimum. 



1 .6 The Hamilton-J acobi Equation 

The Hamilton-Jacobi equation is of special interest since the quantum 
mechanical Schrodinger equation includes the classical Hamilton-Jacobi 
equation as a limiting case. 
Define the action integral 
B 
S= ^L(t,q l ,...,q n ,q 1 ,...,q n )dt (1.54) 

A 

Suppose the integral (1 .54) is evaluated along the path in the (w + 1) 
dimensional space defined by q lt . . .,q„,t representing the natural 
motion of the system from a point A to a point B. This is the unique 
stationary path and the value of S must only depend upon the values of 
the co-ordinates and time at A and B. 
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(1.55) 



Consider now a neighbouring stationary path from the same initial 
point A to a new end point Bj. This is possible since the velocity com- 
ponents have not been defined at A. Let 8q iB , i = 1, . . ., " and Sr B be the 
displacement of B! from B. The change in 5 can be expressed in terms 
of these increments. Only an outline of the argument is given. (For 
further details see the references at the end of the chapter.) 

With the aid of (1.21), the integral (1 .54) becomes 

5 = J (ipi (ten - H(q u . . ., q„,q u ■ ■ ■> Qn, drj. 

After some manipulation, the variation in 5 is 
55 = ZPiB^QiB - Hv§tn 

i 

+ j (2 6p,- dq, - 2 §<7," dp, -8Hdt + 8tdHJ. (1 .56) 

Note that there is no reason to take 55 = (c.f. equation 1 .52) since 55 
is the variation in 5 between two stationary paths. The variation in H is 
given by 






8t 



(1.57) 



and so 



55 = 2 Pib§<7»b - Hft8t h 



■/H(*-S 



(1.58) 

As the integral is taken along a stationary path representing a natural 
motion of the system, Hamilton's equations (1 .26) are satisfied (also 
(1.33)) and so the integral is zero. 



55 = 2p/b -#bS*e 



(1.59) 



Replace q iB and f B by q { and t to represent any point on the path. 
As the increments are independent 

S = S{ qi ,...,q n ,t) (1-60) 

with 



95 
dq, 



■=Pi 
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and 










35 


-,Rn,Pl, ■ ■ ■ 


Pn.O- 


Combining these two 


equations 






as „/ 
37 + ^K-- 


35 

> *7«» ^ > • • • 

9<7i 


35 



(1.61) 



: 0. (1.62) 

This partial differential equation satisfied by 5 is the Hamilton-Jacobi 
equation. For a conservative system the Hamiltonian does not explicitly 
involve the time and the equation reduces to 

/ 35 35 \ 35 

^••'^•••'3^) = -37 = * (L63) 

where E is the energy of the system. 

For a rigid rotator with moment of inertia / the Hamiltonian is 



and the Hamilton-Jacobi equation is 



2/ 



35\ 2 ^ 1 IdS* 
30 7 + sin 2 0\ 30 



35 
3f 



The solution of a dynamical problem can be reduced to the solution 
of the Hamilton-Jacobi equation but this will not be pursued here. The 
interesting relationships between the Hamilton-Jacobi equation and the 
Schrodinger equation of quantum mechanics is mentioned in chapter 
four. Schrodinger used this equation in the development of wave mech- 
anics. By comparing Hamilton's principle with Fermat's principle an 
analogy is obtained relating particle trajectories to rays and Hamilton's 
function 5 to the phase of a wave. 



PROBLEMS 

1 A particle mass of m distance r from the origin, is moving in a plane 
under an attractive force jum/r 2 directed towards the origin. Write down 
the kinetic and potential energies and verify that Lagrange's equations 
may be written 



"-5 1 -* "•£** 



mf-mrd 2 +^- = 0, m— (r 2 d) = 0. 



2 A double pendulum consists of two uniform rods AB and BC each of 
mass m and length / hinged together at B. The end A is hinged at a fixed 
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point so that the rods can ostillate in a vertical plane. The configuration 
of the system is specified by the angles and (/> between the rods AB, BC 
and the vertical. Write down the kinetic and potential energies and 
obtain the two Lagrange equations. Assuming the oscillations are small 
find the solutions of Lagrange's equations of the form 

6=A l e i " t <l>=A 2 e iwt 

where Xl^ 2 are constants. Confirm that the two normal frequencies 



1 V%? , 2 V& 

">-&TT) and W2= 7^7i^ 



where g is the acceleration due to gravity. 

3 Consider a system which moves according to equation (1.15) 



df \9<7i/ &?i 



Integrate this equation with respect to time from t = to t = r. Proceed 
to the limit t -»• and assume that the generalized force Q t -*■ °° in such 
a manner that the impulsive force 

T 

I t = lim [ Qi At 
is finite. Using L = T — V show that 

That is, the change in the generalized momentum is equal to the 
generalized impulsive force. 

4 A particle of mass m undergoes simple harmonic oscillations along 
the x-axis. The controlling force is directed towards the origin and has 
value — kx. Write down the kinetic and potential energies and hence the 
Lagrangian. Find the momentum conjugate to x and show that the 
Hamiltonian of the system is 

H = — p 2 +^rkx 2 . 
2m 2 

5 A two dimensional harmonic oscillator has kinetic energy !Tand 
potential energy V given by 
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Derive Lagrange's equations of motion and find the conjugate momenta 
Pi,P2- Show that the Hamiltonian of the system is 

H{p, <l)=J^ [P 2 i + P2 +m 2 co\qj + q 2 )]. 

New co-ordinates Q lt Q 2 with momenta P\,P 2 are defined by the 
transformation 

rr~ r~T~ 

1i = ./ • v^i sin Q 1 Q2= J \fP~2 sin Q 2 

pi =^/2mu>yJFi cos Qi pi = yJlmuyJFz cos Q 2 . 

Confirm that the transformed Hamiltonian is 

/r' = co(/ > 1 + P 2 ). 

A necessary condition for this transformation to be canonical is that 
Hamilton's equations (1 .26) are satisfied for this new Hamiltonian. 
Show that these equations are 

P t = and <2i = ^ 1=1,2 

and verify that they are indeed satisfied. 

6 Prove the following properties of Poisson brackets 
(i) {E + F, G} = {E, G] + {F,G} 
(ii) {F, {G, K}} + {G, {K, F}} + {K, {F, G}} = 

where E, F, G, K are all functions of q t and p t . 

1 Suppose A, B are two fixed points in space, A higher than B and 
choose the two co-ordinate axes so that both points lie in the (x, y) 
plane. Let the co-ordinates of A and B be (x l ,y 1 ), (x 2 ,y2) respectively. 
y =y(x) is the equation of a curve joining A and B. A smooth particle 
starts from rest at A and falls under gravity down the curve. 
Show that the time taken to reach B is 



r- l f / 1 + y' 2 .1 u > d y 

T-— — dx where y =— . 

Show that T is a minimum when the curve joining A and B satisfies 

(1 +y' 2 )(yx - y) = constant. 
Introduce the parameter \p with y = tan i// and derive the solution 

y - y\ = A(l + cos 2i//) 

x = B - A(2i// + sin 2i//). 
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These are the parametric equations of a cycloid. (This is the famous 
brachistochrone problem). 

8 The Hamiltonian for a particle of mass m moving vertically under 
gravity is 

1 , 
H (x, p) = y~P ' + mgx 

where x is the height of the particle above the zero for the potential 
energy. 

Obtain the Hamiltonian-Jacobi equation. Verify that 



FT 2 

S = -a 1 t+ J— — («i -mgx) 312 +a 2 
V m 3g 



m 3g 

is a solution where aj and a 2 are constants. Using equation (1.61) 
explain the physical significance of the constant a x and find the 
momentum as a function of x. 
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CHAPTER 2 



Origins of the Quantum 
Theory 



2. 1 Failure of Classical Mechanics 

The classical mechanics formulated by Newton and developed by 
Lagrange, Hamilton and others, is very successful in explaining the 
dynamical motion of macroscopic objects and before the beginning of 
this century was universally accepted as being the ultimate theory of 
dynamics. 

A similarly satisfactory state of affairs existed concerning the theory 
of light. The work of the Frenchman A. J. Fresnel had led to the accept- 
ance of the wave nature of light. Later the Scotsman Clerk Maxwell 
developed the electromagnetic theory of light and in 1887 the German 
physicist Hertz confirmed experimentally that electromagnetic waves 
are emitted by oscillatory electric charges. 

However, by the end of the nineteenth century these classical theories 
were unable adequately to explain a growing number of small-scale 
(atomic) phenomena. Physicists were beginning to explore the structure 
of matter and the nature of radiation and the interaction of radiation 
with matter. As a consequence, the foundations of physics were radically 
re-examined and the first quarter of the present century was to see the 
development of the theories of relativity and quantum mechanics. This 
chapter attempts to trace the history of the more obvious failures of 
classical mechanics and how they were overcome, firstly by Planck, 
Einstein and Bohr, and later by Schrodinger and others. 
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An obvious break -down in classical theory occurred in the attempts to 
account for the continuous spectrum emitted by a black -body. A black- 
body is defined as one that absorbs all incident electromagnetic radiation. 



ew 




Rayleigh-Jeans 

Experimental 



= T r 



T 2 >T, 



r= r, 



Figure 2.1 Radiation density from a black-body radiator. 



It follows from thermodynamics that such a body is a better radiator at 
every frequency that any other body at the same temperature. The 
frequency distribution of the spectrum depends only on the temperature 
and not on the material of the black-body. A true black-body does not 
exist but it is clear that the radiation emitted from a small opening of a 
hollow enclosure is equivalent to black-body radiation. 

In 1879 Stefan suggested that the rate of heat loss from a hot body is 
proportional to I 4 where T is the absolute temperature and Boltzmann, 
using classical thermodynamics was able to derive the result 

c 

where E is the total radiation density from all frequencies and a is a 
constant; c is the velocity of light. Wien in 1893 also used a classical argu- 
ment to show that the wave-length, which is emitted with maximum 
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intensity from a black-body, is proportional to l/7\ This is the Wien 
displacement law and is in agreement with experiment. 

So far classical theory was adequate. However, there was no theory 
to explain the observed frequency distribution of the black-body 
radiator (Fig. 2.1). Wien proposed a semi-empirical theory which agreed 
in the short-wave limit while Lord Rayleigh followed by James Jeans 
gave a theoretical result based on classical reasoning which agreed at long 
wave-lengths but was in complete disagreement for short wave-lengths. 
The essentials of their argument are given below. 

The diffraction experiments of Young in 1803 and the Maxwell theory 
of electromagnetic radiation had led to the acceptance of the wave nature 
of light. Rayleigh and Jeans used wave theory together with the classical 
law of equipartition of energy to obtain the Rayleigh-Jeans formula. 
Following Hertz, the emission and absorption of radiation was taken to 
be due to oscillatory charges within the black-body. It was assumed that 
there are a great number of linear harmonic oscillators with all possible 
frequencies in equilibrium at the temperature in question. 

Consider such a system. Let E(p)Su be the density of radiation energy 
per unit volume with frequencies between v and v + 5v. It can be shown 
{Classical Electricity and Magnetism, p. 336 by Panofsky and Phillips) 
using classical electromagnetic theory that when the oscillators are in 
equilibrium with the radiation in a black-body cavity 

r,, x 87TP 2 , , 

E(v)=-^-u(y) (2.1) 

where u(y) is the average energy of the oscillators with frequency v and 
c is the velocity of light. 

The classical 'Law of Equipartition of Energy' states that the mean 
energy of the oscillators with frequency v is 

u(y) = kT (2.2) 

where k is Boltzmann's constant and T is the absolute temperature. From 
(2.1) and (2.2) 

E(p)Sv = -^- kT Sv 
or 

S-TT 

E(X)8\=- i kT8\. (2.3) 

A 

This is the Rayleigh-Jeans law and it implies that the energy radiated 
in a given wave-length range 5\ increases without bound as A becomes 
smaller. This non-physical result is of course experimentally incorrect 
(Fig. 2.1) and the Rayleigh-Jeans law is in agreement with experiment 
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only for long wave-lengths. Also, equation (2.3) implies that the total 



energy emission is 

c 



E = | £(v) dp = 4 kT J v 2 dp (2-4) 



and clearly E is infinite. The Rayleigh-Jeans law demands that the total 
energy radiated per unit time is infinite at all temperatures except T - 0. 
This obviously false result is often referred to as the ultra-violet 
catastrophe. 

Clearly there was a fundamental error in classical theory. 



2.2 Origin of the Quantum Theory 

Max Planck 

In 1900, before the publication of the Rayleigh-Jeans law, the German 
physicist Max Planck discovered an empirical relation that agreed with 
the entire continuous spectrum emitted by a black-body radiator. He 
then worked to find a physical explanation for his formula and succeeded 
in doing this after several weeks work. His results, published in 1901, were 
the origin of the quantum theory. The first quarter of the present cen- 
tury saw the subsequent development of the 'Old Quantum Theory' 
which was to be superceded in 1925 by the 'wave mechanics'. 

Planck also assumed that a black-body is composed of oscillators in 
equilibrium with the radiation field and that equation (2.1) holds true. 
He realized that some drastic change was necessary to obtain a fit to 
the experimental data and his basic assumption was that the material 
oscillators can only have discrete energy levels rather than a continuous 
range of energies that had been assumed before. In particular Planck 
assumed that an oscillator with frequency v can only take the values 

e n =nhv n = 0, 1,2, ... (2.5) 

where h is a universal constant, later to be known as the Planck constant. 
Planck chose the value h = 6-55 x 10" 34 joule sec (units of action) to 
agree with the experimental results but the value accepted today is 

h = 6-624 x 10~ 34 joule sec. (2.6) 

Note that the difference between consecutive energy levels of a Planck 
oscillator is an extremely small 'quantum of energy'. 

Planck did not introduce the concept of the photon. He merely 
suggested that matter absorbed or emitted radiation energy in discrete 
amounts, (i.e. multiples of hv). 

The argument below is not in the form given by Planck who used a 
thermodynamic derivation. Consider the jV„ oscillators with frequency v. 
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In equilibrium at temperature T the number of oscillators with energy e„ 
is, from the classical Boltzmann expression, equal to 



N v e-*n' kT 2 e-*"' kT . (2.7) 



rt = 



The mean energy per oscillator of frequency v is 



u{v) = I e n e-*n' kT 2 e-*»' kT . (2.8) 

n = / n = 

Substitution of equation (2.5) gives 

u(y) = hvl(e HvlkT -\). (2.9) 

The Planck expression for the density of radiation is obtained from 
(2.1) and (2.9) to be 



87TP 2 hv 

(e hv < kT -\y 



E ly) = — < e hv/KT_ ^ ■ (2.10) 



Planck's law (above) agrees very closely with the observed radiation 
distribution when h is chosen correctly (see (2.6)). In the long wave- 
length limit 

kT 

and Planck's expression reduces to the Rayleigh-Jeans. The total radia- 
tion density arising from all frequencies can be obtained using (2.10) 
and 

E=]e(?)Av^ 5 ^T\ (2.11) 

The energy density is proportional to the fourth power of the absolute 
temperature. This result was first suggested by Stefan. 

Albert Einstein— The photo-electric effect 

In 1905, Albert Einstein an employee of the Swiss patent office, took the 
next important step in the development of the quantum theory. Planck 
had suggested that oscillators absorb or emit radiation in energy bundles 
but that in free space the radiation obeys the continuous laws of Maxwell. 
Einstein pointed out that these ideas are not compatible and suggested 
that electromagnetic radiation only exists in discrete energy bundles later 
to be called photons. This was a step that Planck had hesitated to take as 
it opened the old wave-particle dialogue about the nature of light. The 
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first major success in the new theory was Einstein's explanation of the 
photo-electric effect. 

In 1887, Hertz in his 'hoop' experiments confirmed the existence of 
the electromagnetic waves predicted by Maxwell. During the same 
experiment he noticed that when light flashes from the transmitting 
circuit shone on the ends of his detector 'hoop', the weak sparks in the 
gap passed more easily. The Hertz effect was explained by Lenard who 
showed that a negatively -charged metal loses its charge when exposed to 
ultra-violet light because the electrons gain energy from the radiation and 
are emitted. This is now known as the photo-electric effect. Further 
experimental investigation of this effect showed that the number of 
electrons emitted increases when the intensity of light is increased but 
that the maximum observed speed of the electrons only increases if the 
frequency of the light is increased. This final result cannot be explained 
by the classical Maxwell theory which clearly demands that the electron 
speed should depend upon the light intensity. Einstein gives the 
explanation in terms of photons. 

Einstein looked upon the photo-electric problem as one of particle 
collisions. He looked upon a monochromatic beam of light as being 
composed of 'particles' called photons, each of which has an energy hv 
where v is the frequency of the light. 



I 

Metal % 
surface Y/ 



Electrons 



Light 



Figure 2.2 

Essentially, Einstein's argument was as follows. When an incident 
photon collides with an electron in a metal surface, the energy of the 
photon is transferred to the electron and when the electron is emitted 
from the metal surface it has a kinetic energy given by 



i 



mv 



2 _ 



hv-P 



(2.12) 
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Figure 2.3 Dependence of stopping potential on incident light frequency. 

where \ mv 2 is the electron kinetic energy and P is the work required to 
extract the electron from the metal. P varies from metal to metal and is 
of the order of a few electron volts. 

The result (2.12) was verified by Millikan in 1916 in a series of 
experiments using sodium and potassium as the photo-electric surface. 
A stopping particle V may be defined by 

V Q e =%mv 2 . 
V is the applied potential required to prevent any emission of electrons 
and the experiment graph of eV against incident light frequency is 
linear as predicted by (2.12). The slope is equal to Planck's constant. 



Specific heat of solids 

In 1906 Einstein applied quantum ideas to the theory of the specific 
heat of solids. 

Classical theory predicts that the heat capacity at constant volume is 
independent of temperature. Consider a solid composed of N identical 
atoms each of which can vibrate about its equilibrium position. The 
classical 'Law of Equipartition of Energy' demands that the mean energy 
of such a three-dimensional oscillator in equilibrium at absolute tempera- 
ture T is 3kT and the mean energy of the solid is 

(2.13) 



u = 3NkT. 

For one mole, N = 6-025 x 10 23 (Avogadro's number) and 
u = 3RT 



(2.14) 
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where R is the gas constant. The lattice contribution to the molar heat 

capacity is 



C v = ( — } = 3R ~ 6 cal/deg mole. 



(2.15) 



Classical theory predicts that the heat capacity per mole is the same for 
all substances at all temperatures. At room temperatures and above many 
solids obey this law but agreement fails entirely at low temperatures. 
Experimentally it is found that the heat capacity approaches zero as 

Einstein, following Planck, assumed a solid can be represented by a 
collection of harmonic oscillators which can only take discrete energy 
values nhv, n integer, where v is the oscillator frequency. Einstein con- 
sidered the simplest model and assumed all the oscillators have the same 
frequency v . 

The internal energy of the solid is then 



3hv 



u - N {e hvjkT^ l y 
The lattice heat capacity at constant volume is 






3Nk 



JeIT 



B_e_ 



(2.16) 



(2.17) 



where the Einstein temperature E is defined by hv = kd E . The Einstein 
temperature for a material may be chosen to make equation (2.17) give 



Einstein 



— • — Experimental 
Classical 




Figure 2.4 Molai heat capacity. 
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a reasonable fit to the experimental results and is of the order of a few 
hundred degrees absolute. 

Einstein's formula is in fair agreement with experiment for T> 6 E 
but at low temperatures when T< 6 E the agreement is poor. For non- 
metals, the heat capacity approaches zero as T 3 whereas Einstein's 
formula predicts 

c v =3Nk\^ e-'Bir T<dE . (21g) 

Debye (1912) successfully explained the T 3 variation at low tempera- 
tures by considering a spectrum of frequencies to be present. 



2.3 The Hydrogen Atom 

Experimental spectroscopy developed rapidly during the second half of 
the nineteenth century. Because the emission of light was thought to be 
the result of vibrations, harmonic relations between the spectrum lines 
were looked for without success. However in 1885 Johann Balmer dis- 
covered that the wavelengths of all the spectrum lines of the hydrogen 
atom then known could be expressed by the formula 

A„ = 2 _ 4 b « = 3, 4, 5, . . . 

where b is a constant. This equation is generally written 



2« X n "\2 2 i?) < 2 - 19 > 

where k n is the wave-number and R is the Rydberg constant with 

R = 1-097 xl0 7 m _1 . 

The formula (2.19) gives excellent agreement with experimental results. 

Balmer and Ritz suggested that other spectral series might exist for 
the hydrogen atom with the first term different from 1/2 2 . These have 
of course been found and the general expression for the wave-length of a 
hydrogen atom spectrum line is 

k _ 1 / 1 1 \ 

2n~\~ [m* ~^*f n>m ' each an ****&*■ ( 2 - 2 °) 

Lyman found the 'ultra-violet' series with m= l,n=2,3, 4, ... and 
Paschen discovered the 'infra-red' series with m = 3, n = 4, 5, 6 
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From (2.20), the wave number of any line of the hydrogen atom spectrum 
is the difference between members of the series of 'terms' 

f„=4 » = 1.2 < 2 - 2l > 

n 

The spectrum lines of other elements can also be obtained as the 
difference of two terms, i.e. 

t-{-"-'<- (2 ' 22) 

However the terms t t , tj generally are more complicated than those used 
for the hydrogen spectrum. This is the Ritz Combination principle. 
Although these empirical formulae were very useful they had no theoretical 
explanation. 

In 191 1 Ernest Rutherford suggested the 'nuclear atom'. From his 
experiments on the scattering of a-particles Rutherford was lead to 
suggest that an atom consists of a small positively charged nucleus 
surrounded by outer electrons. (The electron had been established by 
J. J. Thomson in 1897.) 

Unfortunately classical deductions from Rutherford's 'nuclear atom' 
do not explain the observed atomic spectra. No stationary configuration 
of charges can be in stable equilibrium under their own electrostatic 
forces (Earnshaw's theorem) and so the electrons must be in motion 
relative to the nucleus. Such electrons must orbit in some way and the 
corresponding acceleration will result in energy loss by electromagnetic 
radiation. Hence electrons will gradually spiral into the nucleus. More- 
over the frequency of the radiation emitted will gradually increase as 
this spiralling continues and a continuous spectrum will be produced 
instead of the observed discrete spectral lines. 



Niels Bohr 

In 1913 the Danish physicist Niels Bohr combined the 'nuclear atom' 
concept of Rutherford with Planck's quantum hypothesis to give a 
theory that accounted for the observed spectrum of hydrogen. Bohr 
made two basic postulates. 

(i) He assumed that the electron is a particle that can revolve about 
the nucleus only in certain allowed orbits which may be circular 
or elliptical. Each orbit represents a stationary energy state in 
which electromagnetic radiation is not emitted, in conflict with 
the predictions of classical theory. 

The allowed orbits or states are those in which the angular 
momentum of the electron about the nucleus is an integral 
multiple of ti = h/lir. 
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(ii) Emission or absorption of radiation occurs when the electron 
makes a quantum jump from one stationary state to another. 
When the atom makes a transition from a higher energy state 
Ei to a lower energy state Ef Bohr suggested, (following Planck), 
that the energy difference is radiated as a photon of frequency 
v with 

E t -E f = hv. (2.23) 

An outline of the Bohr theory is given below. 

Bohr used classical methods to calculate the energy of his allowed 
stationary states. The force of attraction between the hydrogen nucleus 
and the electron is 

e2 (2.24) 



47re r 2 



where r is the distance of the electron from the nucleus and e is the 
permittivity of free space. The electron must move around the nucleus 
in an orbit that is a conic section. The simplest case is that of a circle 
and the inward acceleration is 

™H! (2.25) 

r 

where m is the electron mass and v is the speed. From Newton's second 
law and equations (2.24) and (2.25) above, the kinetic energy is seen to be 

r = ! mi; 2 = _£l (2.26) 

2 87re r 

The potential energy of the electron is given by 

4ne r 
and so the total energy of the electron in a circular orbit of radius r is 

E=T+v = ^-_ (2.27) 

87re r 

The zero of energy is taken to be the ionized atom where r = °°. 

From Bohr's first postulate the radii of the allowed orbits are given by 

mvr = nh n = l,2,3, .... (2.28) 

From (2.26) and (2.28) it is possible to eliminate t; to give 



r = - 



n 2 fi 2 A-ne 



me 2 



(2.29) 
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and the energies of the allowed states are 



E„=- 



me 



2{Aite Q yfi i n 



-y n = 1,2, . 
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(2.30) 



When an electron jumps from the «,- orbit to the nf orbit the 
frequency of the radiation emitted is given by 

Et-Ef 



i.e. 



27T 2 me 4 



v = - 



1 



«} 



n} 



(47re ) 2 ^ 3 
The wave number of the radiation is given by 



(2.31) 



k_ 

2tt 



1_. 



and so 



Jc_ 
2tt 



2-n 2 me 4 






1 



(47re ) 2 fc 3 c 
This result agrees with the formula (2.20) if the Rydberg constant 



R 



27r 2 me 4 
(47re ) 2 ^ 3 c 



(2.32) 



The value of R calculated from (2.32) using the accepted values of m, e, 
c and h agrees excellently with the experimental value. This theory was 
a mixture of classical and quantum concepts. Bohr assumed that classical 
theory could be used to calculate the energies of the states but used his 
own quantum argument to determine the states allowed. 

Bohr's theory dealt only with circular orbits. A. Sommerfeld (1916) 
(see also Wilson 1915), generalized the Bohr theory to include elliptic 
orbits. He said that the allowed states of a periodic system are determined 
by the quantum conditions 

(2.33) 



<j> pi dq ( = n t h n t integer 



where p t is the momentum conjugate to the co-ordinate q t and the 
integration is over a complete period of the motion. 

Consider an electron revolving about the hydrogen nucleus in a plane 
elliptic orbit, with the nucleus at one focus. The single Bohr condition 
(2.28) is replaced by the two conditions 

(2.34) 



^p d0 = W0/i §p r dr = n r h 
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Figure 2.5 Elliptic electron orbit. 



where n$, n r are both integers. The momentum p^ is simply the angular 
momentum and from Kepler's law is a constant. 

p^ln = n^h or p^ = n$ti 



assumed by Bohr, n^ is called the angular or azimuthal quantum 
imber. n r is the radial quantum number and the principle quantum 



as 

number, n, 

number is defined by 



n=n r +n,p. (2.35) 

It can be shown that Bohr's energy expression (2.30) holds for elliptic 
orbits where the n in the formula is now the principle quantum number. 
The introduction of elliptic orbits does not produce any new levels for 
the hydrogen atom although more than one state (orbit) may have the 
same energy. However, when the relativistic mass variation of the electron 
is included it is found that this 'degeneracy' is removed and that the 
energy depends upon both n and n^. Of course, a particle electron 
possesses three degrees of freedom and it is to be expected that a third 
quantum number should be necessary to define a state completely. This 
is the magnetic quantum number. 

Simple harmonic oscillator 

When the Sommerfeld rule is applied to the linear harmonic oscillator the 
energy levels used by Planck are obtained. Consider a particle of mass m 
oscillating about the origin in the x-direction. The energy E of the system 
is given by 

2 

E = — + 2n 2 v 2 mx 2 
2m 

where p is the momentum conjugate to x and v is the frequency. The 
Sommerfeld 'action' integral (2.33) becomes 



Lp{x)Ax=2m — -4ir 2 u 2 x 2 

— a L 



1/2 

dx = nh n integer 
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where 



2E_ 
V m 1 


jlTJV. 


This integral can 


be evaluated by substituting 


l2E 
x = / — 
v m 


sin 0/2irv 


to give 




E . 
— = nh 

V 




i.e. 




E = nhv. 




This is equation (2.5). 



Bohr correspondence principle 

The Bohr theory of hydrogen atom even when developed using the 
Sommerfeld condition was clearly not complete. It did give an expla- 
nation of the Zeeman and Stark effects, but unfortunately the Bohr 
theory predicted too many spectrum lines. Some type of 'selection 
rules' had to be imposed. Also, although it was obvious that different 
spectral lines had different intensities the Bohr theory had no expla- 
nation of this. In 1918 Bohr added another rule to his theory which was 
to prove very successful in overcoming the difficulties mentioned above. 
This was the correspondence principle. 

In his original paper (1913) Bohr reasoned that in the larger atomic 
orbits, where the energy jumps are smoothed out, the quantum theory 
must give the same results as classical theory. Consider an electron jump 
from a hydrogen state with n t = N + 1 to a final state n f = N. From (2.30) 
the frequency of the emitted radiation is 



27r 2 me 4 



_L 1 

N 2 ~(N+l) 2 



(47re ) 2 fc 3 

In the large quantum number limit N -*■ °° this frequency becomes 
47r 2 me 4 1 



v = 



(4ire ) 2 fc 3 iV 3 



(2.36) 



Now consider the frequency calculated from classical electromagnetic 
theory. Gassically the emission frequency is equal to the frequency of 
revolution of the electron in the circular orbit. (Electrons in elliptic 
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orbits also emit frequencies which are integral multiples of the fundamen- 
tal frequency.) That is, the classical frequency is 

v = v/2irr 

where v is the electron velocity and r is the orbit radius. On substituting 
for v and r using (2.28) and (2.29) it is easily seen that the classical 
frequency is equal to the limiting quantum frequency (2.36). In the 
case where the energy change associated with a transition is small com- 
pared with the energies of the initial and final states, the quantum 
description approaches the classical description. 

In 1918 Bohr extended the original form of the correspondence 
princple and applied it to large energy jumps and managed to obtain 
rules for calculating intensities. Although it proved quite successful it 
was an unsatisfactory mixture of quantum and classical theory and will 
not be considered further. 



2.4 Failure of the Old Quantum Theory 

There were obvious weaknesses in the 'old quantum theory' of Bohr and 
Sommerfeld. It was seen from the beginning to be a mixture of quantum 
and classical ideas. The electron was regarded as a point charge and the 
Sommerfeld rules picked out the allowed orbits which had energies cal- 
culated using the classical theory. However, even though classical theory 
demanded that the accelerating electrons must emit radiation, Bohr 
postulated that this did not happen for electrons in a stationary orbit. 
Also, non-periodic phenomena were not included. There was no quantum 
theory of a-particle emission. 

Although the Bohr-Sommerfeld theory allowed only integral quan- 
tum numbers it was discovered that certain systems demanded half- 
integral quantum numbers. Such an example was the rotation spectra of 
the hydrogen halides molecules. The explanation of this requires a spin 
structure for the 'particle electron'. In some cases quantitative disagree- 
ment occurred between the old quantum theory and experiment (e.g. the 
normal state of the helium atom) and in others even qualitative agreement 
was not even achieved. Disatisfaction with the old quantum theory even- 
tually lead to the discovery of the matrix mechanics of Heisenberg and 
the equivalent wave mechanics of Schrodinger (1926). 



2.5 Wave-Particle Dualism 



Dual nature of light 

By the end of the nineteenth century, the wave theory of light was 
firmly established on the basis of interference and diffraction experiments 
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and the success of the classical electromagnetic theory. This view was 
challenged by Einstein in his theory of the photoelectric effect in which 
energy is transferred to the electrons in quanta of energy hv called 
photons. Further evidence concerning the dual nature of light was given 
by the production of X-rays. 



Electron 
stream 




X-ray 



Photo sensitive 



Ejected 
surface \ electron 



Figure 2.6 

Roentgen discovered X-rays in 1895 and the wave nature of X-rays 
was established by von Laue in 1912 by his diffraction experiments. 
X-ray production may be regarded as the inverse of the photoelectric 
effect and is brought about by directing a stream of electrons onto the 
anti-cathode in an X-ray tube. If the X-rays produced are then used in a 
photo-electric experiment to produce electrons it is found that the maxi- 
mum energy of a photo-electron is equal to the energy of one of the 
electrons in the stream which originally produced the X-rays. This implies 
that the X-rays transport their energy across space in photon bundles 
without any dissipation. 

In 1922-23 Compton took the photon concept a step further when he 
associated a momentum with the photon in his explanation of the 
scattering of hard X-rays. 

Compton effect 

The American physicist Arthur Compton discovered that when hard 
(short wave-length) X-rays are scattered by atoms of low atomic number 
(graphite) the scattered radiation includes not only the original wave- 
length but also softer X-rays of longer wave-length. This is contrary to 
the classical theory which demands that the scattering electrons vibrate 
with the frequency of the incident radiation and so emit radiation of 
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exactly this frequency. However Compton was able to explain this 
phenomenon by assuming that the X-rays consist of a collection of 
photon particles each having an energy hv and also a momentum hv/c 
where c is the velocity of light. 



Scattered photon 
Energy A/ 



Incident photon 
Energy hy 



Figure 2.7 Compton scattering. 




y > y 



Recoil electron 



Consider an incident photon of frequency v which collides with an 
electron and is then scattered. For materials of low atomic number the 
energy of the hard X-ray photon will be very large compared with the 
electron binding energy and so the electron may be considered to be 
free. During the collision, energy is conserved and so 



hv=hv +T 



(2.37) 



where T is the kinetic energy of recoil of the electron and v is the 
frequency of the scattered photon. From special relativity theory 



T = m c 2 



1 



-(;) 



(2.38) 



where m is the electron rest mass and i; is the speed of recoil. Applying 
the low of conservation of momentum 

hv hv m . 

— = — cos + — , i> cos 

c c 



J^) 



and 



hv' 

= — sin i 

c 



m r 



M3 



v sin 6. 



(2.39) 
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There are three equations involving the three unknowns v', v and and 
on solving it is found that the increase in wave-length is 

5X = X '_ X =^L sin 2i (2.40) 

m c 2 

where X and X' are the wave-lengths of the incident and scattered X-rays 
respectively. Note that 6\ is constant for a given scattering angle <t> and 
so the percentage increase in wave-length is greater for small wave-lengths. 
This formula is in quite good agreement with experiment. 

When the frequency of the incident radiation is reduced, the binding 
energy of the electron cannot be ignored and instead of Compton scat- 
tering the photoelectric effect takes place or the X-ray undergoes Laue 
scattering. 

The work of Einstein and Compton when taken in conjugation with 
the work on interference and diffraction leads to the paradoxical result 
that electromagnetic radiation has a dual nature. Sometimes it behaves 
with wave-like properties and sometimes with particle-like properties. 
The attempts made to understand this paradox have resulted in the new 
quantum theory. 



Dual nature of matter 

In 1897 the English physicist J. J. Thomson succeeded in deflecting 
cathode rays by an electrostatic field and proved conclusively that the 
rays consist of material particles with a negative charge. For the next 
quarter century no doubt was expressed as to the particle nature of the 
electron. 

During 1922-23 the Frenchman Louis de Broglie suggested that not 
only does radiation have a dual nature but material particles also require 
a wave-particle description. He was primarily interested in the nature of 
light and on the basis of Einstein's relativity theory he was convinced of 
the dual wave-particle nature of light. 

From special relatively, the relation between the total energy E of a 
particle in field free space and its momentum p and rest mass m is 

fEV =p*+mlc 2 . (2-41) 

Taking the rest mass of a photon as zero 
(— ) =p 2 or E=pc. 

This final result is consistent with the Einstein formula 
E=hv 
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and the Compton result 

p=h/K. 

In relativity both radiation and matter appear as forms of energy. L. de 
Boglie suggested that any moving particle has associated with it a group 
of waves whose wave-length X depends on the momentum of the particle 
according to the equation 

p=h/\. (2.42) 

This is the equation used by Compton for photons. The wave-length 
associated with large bodies (e.g. a billiard ball) moving at low speeds is 
far too small to be detected experimentally. However, the wave-length 
becomes appreciable for atomic particles. An electron with a 10 eV 
kinetic energy has a wave-length of 5 - 3 x 10 -8 cm (5 A). 

De Broglie noticed correspondences between the classical theory of 
light and the classical theory of mechanics which help to suggest equation 
(2.42). The French mathematician Fermat had reduced the laws of geo- 
metrical optics to the principles of 'least -time'. That is, a light ray follows 
the path requiring the least time. The time taken for a light ray to travel 
from a point A to a point B is given by the line integral 



I 



1 d S 



V p (x, y, z) 



where V p is the phase velocity in the medium and is a function of 
position in general. As 

_L -I I 

V p " v X X 

where v is the constant frequency, Fermat's principle may be written 

B 

6 f l ds = 0. (2.43) 

J \(x,y,z) 

A 

There is a corresponding principle in classical mechanics. This is 
Hamilton's principle (1.52) which states 

B 

5 \ Ldt = 0. 

A 

For a single particle of constant total energy E and kinetic energy T then 
L=2T-E as E = T+V 



ORIGINS OF THE QUANTUM THEORY 39 

and so 

B 

5 f 2Tit = 0. 

A 

By changing the variable of integration from the time to distance it is not 
difficult to show that this final result can be written 

B 

8 f p(x,y,z)ds = (2.44) 

A 

and the similarity between (2.43) and (2.44) is striking. The relation 
p = constant /X 

is immediately suggested. 

This analogy between the variational principles of geometrical optics 
and particle mechanics had of course been noticed much earlier and had 
resulted in the Hamilton-Jacobi equation discussed in Chapter 1 . 
Schrodinger used this equation in his development of the Schrodinger 
wave equation for quantum mechanics but the discussion of the relation 
between the Hamilton-Jacobi and Schrodinger equations is left till 
Chapter 4. 

The first experimental verification of the de Broglie matter waves 
was obtained in 1925 by Davison and Germer who were conducting 
experiments involving the scattering of slow electrons by nickel in a 
vacuum. De Broglie's formula was completely verified by G. P. Thomson 
using a thin film of gold for his scattering material. 

Another aspect of the relation between particles and waves is 
illustrated by considering the concepts of phase and group velocity. 

The phase velocity V p of a de Broglie wave is given by 

V p = i>\ (2.45) 

where v and X are the frequency and wave-length associated with the 
particle. Writing, with the usual notation 

E = hv and p = hfk 
then 

V p =E/p. 

In free space, relativity gives 

"- , m ° p= . m ° v (2.46) 



TRU J^ 



where m is the particle rest mass and v is the particle velocity. 

V D =c 2 /v. (2.47) 
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As energy cannot be transmitted faster than light, v < c and so 



V p >c. 



The de Broglie waves for a free particle travel with a phase velocity 
faster than light. However there is no conflict with relativity as the 
energy of the waves (or particle) is transmitted with the group velocity 
not the phase velocity. This is explained below. 



Group and phase velocities 

The equation of motion of a plane wave of frequency v and wavelength X 
moving in the x-direction is 

/ x 
\p (x, t)=A sin 2ir I vt - — 

where A is the amplitude. This wave moves with a phase velocity 

V p = v\. 

It is possible to form a concentrated wave packet by taking a sum of a 
large number of plane waves with slightly different wave-lengths and 
frequencies. If the medium is dispersive so that plane waves with dif- 
ferent frequencies move with different phase velocities, the wave packet 
constructed moves with a velocity different from the wave velocities. 
This new velocity is called the group velocity. 

The essentials of the argument are obtained by considering the inter- 
ference of two waves of similar but different frequencies and wave-lengths. 
Take 

i/> (x, t)=A [sin (ojt - kx) + sin (co'r - k'x)] 
where k, k denote the wave numbers such that 

k = 2rr/\ and k' = 2tt/X' 
and co, co' denote the angular frequencies given by 

co = 2irv and co = 2nv . 
It follows that 

, , ^ „ . f/ w '- co \ I k' - k 
\p(x, t) = 2A cos — - — t 



x sin 



co+co\ I k +k 

t-\ \x 
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Figure 2.8 Wave packet. 

Put co' = co + 5co and k' = k + 8k and then approximately 



\jj(x, t) ■■ 



, Sco 8k 
2A cosj — t — —x 



sin (cor - fcx). 



(2.48) 



The combined disturbance represents a carrier sine wave with frequency 
v and wave number k with an amplitude that slowly fluctuates so that 



W|A^^^-#iA° 



Figure 2 .9 Gro ups of waves or beats . 

there are groups of waves with gaps in between. The groups of waves 
move with a group velocity 

Sco 
Vg = — where cj = 2vv . 
* 8k 



(2.49) 



In the limit as 8k -*• the group velocity becomes 

V =-^ 
K * 9/fc" 

For the de Broglie waves corresponding to a particle, the total energy 
E and linear momentum p are given by 

E = hv p=fik 

and so the group velocity is 

dE 



Vg dp 



(2.50) 



Trir = 2 P 
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From equation (2.41) 

2EdE_ 
c 2 dp 

and so the group velocity is (from 2.46) 

_ pc 2 vc 2 
g E c 2 

The group velocity ef the de Broglie waves is equal to the corresponding 
particle velocity and so from (2.47) 

V g V p =c 2 . (2.51) 

De Broglie waves and the Bohr atom 

Consider the Bohr-Sommerfeld condition for a circular atomic orbit. The 
angular momentum p^ satisfies 

<j> p d0 = n^i n$ integer 
i.e. 

2nrp = n^h 
where p is the linear momentum. From the de Broglie condition 

p=h/k 
and so the circumference of the circular orbit is 

2nr =n$k. 

The Bohr-Sommerfeld quantization rule is equivalent to demanding that 
the length of the circumference of a stationary Bohr orbit is equal to an 
integral number of de Broglie wave-lengths. This is additional evidence 
that particles have wavelike properties. 

2.6 The New Quantum Mechanics 

Werner Heisenberg 

In 1925 the young German physicist Werner Heisenberg introduced the 
new matrix mechanics. His work was based upon the inadequacies of the 
Bohr correspondence principle rather than the hypothesis of de Broglie 
waves. He found it necessary to introduce non-commutative algebra into 
physics. If q and p represent the position and conjugate momentum for 
a particle then Heisenberg found that in his atomic theory qp is not 
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equal to pq. In 1925 he applied his theory successfully to the simple 
harmonic oscillator and explained the phenomena of zero-point energy. 

Heisenberg found it necessary to introduce a new calculus. Max Born, 
a professor in the university of Gottingen where Heisenberg was working, 
pointed out that this 'new' calculus was precisely the calculus of matrices 
previously discovered by the English mathematician A. Cayley. Born and 
a colleague Jordan sought to extend Heisenberg's work to obtain a new 
theory of the atom. They obtained the important result 

pq- qp = -ffi, (2-52) 

where i is the square root of minus one. The significance of this will 
become apparent in a later chapter (Chapter 5). 

In January 1926 the Austrian Wolfgang Pauli successfully applied the 
Heisenberg theory to the hydrogen atom problem and in the same month 
the young Englishman Paul Dirac produced a more elegant, though more 
abstract, application of the Heisenberg theory to the hydrogen atom. 
He also obtained the relation between the Poisson brackets of classical 
mechanics and the commutators of quantum mechanics and showed that 
there is a very close relationship between classical and quantum 
mechanics. 

Erwin Schrbdinger 

At about the same time the young Austrian Erwin Schrodinger was 
developing his wave mechanics which seemed at the time to be com- 
pletely distanct from Heisenberg's work. He solved the hydrogen 
problem and published his results in the same year as Pauli and Dirac, 
1926. In February he gave the solution for a harmonic oscillator. Bohr 
had found it necessary to postulate the existence of quantum numbers; 
Schrodinger was able to derive them. 

Schrodinger knew that in many classical wave problems, discrete 
frequencies arise naturally when boundary conditions are imposed. 
Consider the vibrations of a string of length / with both ends fixed. 
This transverse displacement 4>{x, i) of an elastic string lying along the 
x-axis obeys the wave equation 

9 2 <J> J_^0_ n (2.53) 

37 2_ K p 2 9r 2 ~° 
where V p is the phase velocity of the wave and t is the time. Variable 
separable solutions that satisfy the boundary conditions 

0(0, r) = «£(/, = 

exist and have the form 

mix (nnVpt \ 
4>{x, t)=Asm—- sin I — j^~ + el, (2.54) 
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x=0 
Figure 2.10 Vibrating string. 



* = / 



where A, e are constants and n can take the integer values 1,2 The 

solutions (2.54) are called 'eigenfunctions' with corresponding discrete 
'eigenfrequencies' . 

v n =nV p lll 

Schrodinger was able to set up the wave equations governing the 
motion of a particle. His equation is obtained below although the argu- 
ment given is in no sense a proof. Wave mechanics cannot be derived 
from classical mechanics but must be postulated as in Chapter 3 of 
this text. The truth of the postulates depends upon the accuracy of the 
predicted results. 

From classical mechanics, the total energy of a particle of mass m in 
a conservative field is given by 



E=f-+V 
2m 



(2.55) 



where p is the particle momentum and V is the potential energy. The 
particle momentum can be written 



p = +y/2m{E - V). 
The phase velocity of the de Broglie matter waves is 

V -i- E 

p P y/2m(E- V)' 

By combining the wave equation (2.53) and this phase velocity 
expression the wave equation for matter can be written 



(2.56) 



V^-^^?=0 



dt* 



(2.57) 



where ^(x, y, z, t) denotes the amplitude of the matter wave at a point 
(x, y, z) at time t. 

States of constant energy correspond to states of constant frequency 
and for these the solutions can be written 



*(x, y, z, t) = \j/(x, y, z) exp [-2mvt]. 



(2.58) 
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V 2 * + 



2m[E-V] . , 



47rV* = 0. 



Using the Einstein relation E = hv then finally 



2m 



S7 2 <ff + V^=E^ or 



2m 



V 2 \p + Vty = E\l/. 



(2.59) 



This final equation is known as Schrodinger's time-independent wave 
equation and gives the states of constant energy. 

Linear box 

Consider the idealized problem of a particle restrained to lie in a linear 

box of length / by walls of infinite potential. Suppose 

V(x) = 0<x<l 

= oo otherwise. 

The Schrodinger equation is 

2m cbr 
The boundary conditions to impose are 

lK0) = tKO = 

as the particle must he within the box. The eigenfunctions (allowed 
solutions) are 

nirx 
\jj n (x)=Asin—— « = 1,2, ... 



V = od 



x = 
Figure 2 . 1 1 Linear box. 



v = 



V = oo 



x = l 
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where A is a constant and the eigenvalues are (allowed energy values) 

c n 2 fc 2 

E " = 8m^ « = 1.2,.... (2.60) 

The application of boundary conditions to the Schrodinger equation 
naturally results in discrete energy values and hence in quantum 
numbers. 

The time-dependent equation 

The time-independent equation (2.59) is sufficient for discussing states 
of constant energy but a more general equation is required for describing 
other states. It is clearly necessary to eliminate the energy £ For solutions 
of the form (2.58) 

** = + *lT (2-6D 

and combining this result with the time independent equation (2.59) the 
time-dependent equation 

h 2 -> 3^ 

_-V^ + ^ = ^— . (2 . 62) 

is obtained. 

The derivation of the Schrodinger equations (2.59) and (2.62) given 
above is not a proof but is given simply to indicate how they were first 
revealed. A formal derivation based upon the postulates of quantum 
mechanics is given in the next chapter. These equations are non-relativistic 
and correctly describe the motion of a particle only at speeds small com- 
pared with the velocity of light. Paul Dirac (1928) established the relavistic 
equation. 

The wave function 

The complex wave function *(x, y, z, i) is a solution of Schrodinger's 
equation and in quantum mechanics takes the place of the classical 
particle trajectory. ^ is some measure of the probability of finding the 
particle although this probability cannot simply be * itself since * can 
be negative. Max Born gave the correct interpretation in 1926. He said 
that 

**(x, y, z, tyi{x, y, z, f)8x8y8z = \V \ 2 8x8y8z (2.63) 

is the probability of finding the particle in the volume element 8x8y 8z 
about the point (x, y, z) at the time t, when a large number of position 
measurements are made on independent single-particle systems each of 
which is described by the wave function V(x, y, z, t). 
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As the total probability of finding the particle somewhere in space 
must be unity the wave function must be normalized to unity, i.e. 

j**(x, y, z, /)*(*, y,z,t)dt=l. (2.64) 

The normalized solutions to the linear box problem are obtained from 

(2.59) to be 

[2 nnx 
<Pn(x) = Jj sin— n = l,2 

The corresponding probability density 

2 , nnx 
p(pc) = n(x)ip„(x) = j sin 2 — 

and in the large quantum number limit «-»•«> the average value of p(x) 
over any interval 8x is 

From Bohr's correspondence principle this should agree with the 
probability density of a classical particle in the linear box. To establish 
this, consider a classical particle moving along the x-axis with constant 
speed v between reflecting walls at x = and x = I. The probability of 
finding the particle in the element 5 x is proportional to the time the 
particle spends in this element and so 

5x 
P(x)Sx = k— 

v 

where fc is a constant of proportionality. As 

jP(x)dx = l 
o 
then 

k=v/l 
and so 

P(x)=l/l 
in agreement with (2.65). 

The equivalence of wave and matrix mechanics 

The matrix mechanics of Heisenberg and Dirac was developed from the 
classical Poisson brackets of Hamiltonian mechanics whereas the wave 
mechanics of Schrodinger was developed from classical wave dynamics. 
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The first steps in showing the equivalence of the new matrix and wave 
mechanics were taken by Schrodinger himself. In 1926 he discovered 
that there is a very simple connection between Hamiltonian dynamics 
and wave mechanics. 

Consider the classical Hamiltonian H for a 'one-dimensional' particle 
of mass m moving in a conservative field. If E is the total energy then 

Pi 

2m 

where P x is the momentum and V(x) is the potential energy. Schrodinger 
suggested replacing the independent variable by itself and also replacing 
the momentum P x by the differential operator -ih d/dx. A quantum 
mechanical Hamiltonian 2ft is then defined by 

2m dx 2 J 

Note that Jf is an operator. The Schrodinger time-independent equation 
(2.59) can then be written 

Schrodinger also realized that the strange result 

p x x - xp x = -ih (2.52) 

inherent in Heisenberg's theory then followed as 

d d 

p x x = -ih - — x ——ih — ihx — = —ih + xp x . 
dx dx 

Later in the same year Dirac showed the complete equivalence of the 
matrix mechanics and the wave mechanics. They are both different 
aspects of the new quantum mechanics (see Chapter 6). 



The uncertainty principle 

In 1927 Heisenberg expounded his famous 'Uncertainty Principle'. In 
general terms this states that it is impossible to know precisely the exact 
position of a particle and its momentum simultaneously. When the 
position is measured the momentum is disturbed and, unlike classical 
mechanics, the disturbance cannot be allowed for. It can be shown that 
if Ax and Ap x represent the uncertainties in the x-coordinate and x- 
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component of linear momentum then 



AxAp 



•x 



This will be formally derived in Chapter 5 and is a consequence of 
Heisenberg's relation (2.52). A simple wave-packet 'explanation' will 
suffice here. Consider the de Broglie-wave packet (2.48) consisting of 
the superposition of two plane waves. The distance between the maxima 
of the modulated wave is 

Ax = 2tt/A& 



or 



AxAA: = 27r. 

In general, if a group of waves of spread Ax and mean wave number k 
is analysed into its constituent plane waves, then the wave numbers of 
the constituents will be found to be clustered about the mean value k in 
a range AA; such that 

AxAk~l. 
After de Broglie 

Ap x =hAk 
and so 

AxAp x ~ft. 
Ax is the uncertainty of position of the particle described by the group 
of waves and Ap x is the corresponding uncertainty in momentum. To 
specify the position exactly so that Ax = an infinite number of plane 
waves would be required and so Ak -* °°. On the other hand, to specify 
the momentum completely only one wave must be used and the wave 
packet will be of infinite extent so that Ax -»■ °°. 

Further developments 

The new theory developed very rapidly after 1926. In 1927 Dirac gave a 
quantum mechanical description of the electromagnetic field. Spin was 
introduced into quantum mechanics by Pauli and the English physicist 
C. G. Darwin and in 1928 Dirac gave a relativistic theory of quantum 
mechanics from which spin arose naturally. The reader is referred to 
Hoffmann's The Strange Story of the Quantum for an interesting non- 
mathematical account of the origins of Quantum Theory. 
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PROBLEMS 

1 Suppose radiation of wave-length X forms standing waves inside a 
cubical box of side length a with perfectly reflecting walls so that the 
walls can be taken as nodes. Explain why 

»,- = «/ 

X 

n 2~Z =am 



n 3 — =an 

3 2 

where «i, n 2 ,n 3 are positive integers and where /, m, n are the direction 
cosines of the radiation direction relative to a set of orthogonal axes 
coincident with three adjacent sides of the cube. Using the relation 

I 2 + m 2 +n 2 = 1 

show that the frequencies of the allowed modes are given by 

where c is the velocity of the radiation. 

Construct a set of orthogonal axes corresponding to the variables 
nic/2a, n 2 cl2a, n 3 c/2a. Each of the points in the cubic lattice in the 
positive octant corresponds to an allowed frequency mode. Deduce that 
the number of allowed frequencies in the range between v and v + hv is 

1 Aitv 2 „ 
8 lc_ 
\2a 

and that the total number of allowed modes per unit volume after 
consideration of polarization directions, is 

v 2 

877— r 8v 

c 

(This result can be used to obtain the equation (2.1).) 

2 Consider the Planck law (2.10) for the radiation emitted by a black- 
body radiator. Show that in the high frequency limit v -*■ °° that 

This is Wien's experimental law. 
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3 In free space the electric field flux out of a closed surface s is given by 

<k $ . n ds = q/e 
s 

where & is the electric field, n is the unit outward normal from the 
surface, q is the included charge and e is the permittivity of free space. 
The electrostatic potential <j> is defined to within a constant by the 
equation S = -grad 0. Show that cannot have a maximum or minimum 
value except at points where there is a positive or negative charge respect- 
ively. Hence deduce that a free charge cannot be in stable equilibrium at 
a point unoccupied by charge. This result implies that no stationary 
arrangement of charges can be in stable equilibrium under their own 
influence alone. 

4 Use the Bohr theory of the hydrogen atom to show that the radius 
of the first Bohr orbit is 



e 2 



°° ~ 2Rhc(4ire ) 
where R is the Rydberg constant and that the energy of this state is 
E = Rhc - 13-6 eV. 

5 Consider a free simple rotator composed of two point masses M dis- 
tance 2a apart. Show that if the rotator is free to rotate in a plane about 
the centre of mass, the total energy is 

E=Ma 2 co 2 

where to is the angular velocity. Use the Bohr-Sommerfeld quantization 
rule to show that a> can only take the values 

co = 



2Ma 2 



where / is a positive integer or zero. 

For a diatomic molecule a reasonable estimate for a is the radius of the 
first Bohr orbit (see problem 4). Show that the allowed energy levels of 
the rotating system are 



*-*M^r 



where R is the Rydberg constant and m is the electron mass. 
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6 Consider a plane 'matter' wave of frequency v and wave number k. 



V(x, i)=A exp 



— i2n [vt — —x 



Rewrite this equation substituting for v and k using the relations 
E = hv and p x =hk 

where E is the energy of the particle and p x is the momentum. Deduce 
that 

9* 
-rft-=p x * 

and 

9* 
ih— = E*. 
bt 

Hence show that the classical energy expression for a free particle 

2m 
can be expressed as the wave equation, 

~2m bx 2 bt ' 
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CHAPTER 3 



Quantum Mechanics I 



3.1 Introduction 

In the previous chapter the early attempts to account for the breakdown 
of classical mechanics were briefly outlined. The new theory was clearly 
not satisfactory and it was left to Schrodinger, Heisenberg and others to 
develop the quantum mechanics. Historically quantum mechanics was 
evolved using intuition and analogy with classical mechanics. Even so, 
there is a basic difference in philosophy between them, and quantum 
mechanics can certainly not be derived from classical mechanics. 

For a system obeying classical mechanics, (this is an approximation, 
although it is very often a very accurate one), if all the positions and 
velocities of the particles are known at some instant then the state of the 
system at some later time is completely determined by Newton's laws. 
This is not the case for a system obeying quantum mechanics. In fact the 
position and momentum of a quantum particle can not both be known 
exactly at the same instant and even the concept of a particle trajectory 
loses its meaning. There is a limit to what can be known about the state 
of a quantum system. In quantum mechanics the physical quantities such 
as energies, momenta etc., which can in principle be directly measured 
are called observables but not all the observables of a system can be 
measured simultaneously. If some observable is measured, this act of 
measurement may disturb the system and change the value of some other 
observable. This is true for both classical and quantum systems, but there 
is a basic difference in outlook. The disturbance in a classical system 
produced by the act of measurement can in principle be allowed for 
exactly but not in quantum systems. The most complete description 
possible of the state of a system in quantum mechanics is less detailed 
than that in classical mechanics. Because of this, the behaviour of the 
quantum system at some future time is not uniquely determined. 
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Classical mechanics is based on Newton's three laws. These cannot be 
proved directly but are verified by the experimental results obtained 
from 'classical systems'. Similarly, quantum mechanics is based on 
postulates. At first the postulates seem strange, but their validity is 
confirmed by the agreement of the theory with experiment. This postu- 
late statement of quantum mechanics is the most effective introduction 
to the subject and this is given below. 



3.2 The Postulates 



Postulate 1 

Any state of a system with n degrees of freedom is described as completely 

as possible by a wave-function *(qi q„,t) which depends upon the 

co-ordinates q t and the time. * is in general complex and as it has to be 
physically meaningful it is single-valued. The wave-function may be 
multiplied by an arbitrary complex number without any essential change 
in its physical significance. It will be shown that * and its derivative are 
continuous except at a certain number of points and in general it is also 
quadratically integrable, i.e. 

[■qr*yj/ dr = k 2 A; finite and real (3.1a) 

where ** is the complex conjugate of * and the field of integration is 
over all values of the co-ordinates. As the wave-function can be multi- 
plied by a constant without changing its character, it is often convenient 
to describe the state by 

This new wave-function is 'normalized' in the sense 

J*t*,dT = l (3.1b) 

and once normalized remains so for all time. ^ remains normalized if it 
is multiplied by any complex number of modulus unity. 

Definition 1. Linear operator 

An operator transforms one function into another. A well-known 
example is the differential operator D = d/dx, e.g. 

Dx 3 = 3x 2 . 

An operator a is said to be linear if 

aQ&i + * 2 ) = a *i + °&2 and a(aty)=aaty (3.2) 
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where * ls * 2 are arbitrary functions and a is an arbitrary constant. The 
operator D is linear. 

A simple example of a non-linear operator is the square operator which 
will be denoted by S, i.e. 

then 



Definition 2. Hermitian operator 

Suppose *j and * 2 are arbitrary quadratically integrable functions. The 
linear operator a is said to be Hermitian if 



J*?(a* 2 ) dr = J* 2 («*i)* dr 



(3.3) 



the integration being taken over all values of the co-ordinates. Clearly a 
linear combination of Hermitian operators a, is itself a Hermitian 
operator and any power of a Hermitian operator a is Hermitian, e.g. 

JV*(a + /3)* 2 dr = j"**a# 2 dr + jv*p*2 dr 

= f * 2 (a*i)* dr + f * 2 (0*i)* dr = f * 2 [(a + /3)*i] * dr. 

(3.4) 

Postulate 2 

With every physical observable there is associated a linear operator. This 
linear operator is Hermitian. 

Of course it is essential to determine the precise form of the operator 
a which represents the observable A. It is at this point that analogy with 
classical mechanics is important. Suppose a and are two quantum 
operators representing variables A and B. The commutator of these two 
operators is defined to be 

[a,P]=(ofi-Pa). (3.5) 

[Note; the r.h.s. of (3.5) is not in general zero. Suppose a = x and = d/dx 
then 



dx 



dx dx 



d 


d 1 




x — - 1 - 


-x — 


= -1. 


dx 


dx 





Commutators have many of the properties of Poisson brackets, e.g. 
[a,p]=-\p,a], [a,o]so. 

The quantum operators a, are such that their commutator is propor- 
tional to the classical Poisson bracket of A and B. The constant of 
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proportionality is a universal constant (with the dimensions of 'action', 

i.e. Joule-sec.) and introduces the Planck constant h into quantum 

mechanics. 

i_ 

[a,&\=m{A,B] *=— i = y/-i. (3.6) 

(If there are any variables remaining on expansion of the Poisson bracket 
they are to be replaced by operators.) 

The form of the operator is not uniquely determined by (3.6) but 
depends on the 'representation' used. In the commonly used Schrodinger 
representation any position co-ordinate q t or the time t is represented by 
q t or t respectively. The momentum p,- conjugate to q t is represented by 
the operator 

1 JL 

It is easy to verify that the commutator of the operators representing 
q t and p/ is 



n a 



¥=j»6 v * (3.7) 



and by comparison with (1.30) it is seen that (3.6) is indeed satisfied. It 
is obvious that the operator q t associated with the co-ordinate q t is 
Hermitian. Similarly it is not difficult to show that the operator 
representing the momentum p/ is Hermitian. On integrating by parts 

and if the wave function vanishes at infinity the first term on the r.h.s. is 
zero and the result is proved. Any classical observable (such as energy), 
which is a function of the co-ordinates, momenta and time is represented by 
an operator obtained by substituting 

<7/-*-«< Pi-*Tr — t-*t. (3.8) 

i dqt 

For example, any function of p,-, /(p,) is represented by 

and similarly for /fe). Any functions of conjugate variables must be 
carefully ordered before converting into an operator. This is necessary 
because the operators representing the variables (<7;,P/) do not commute. 
In classical mechanics it is meaningless to distinguish between p { q t and 
q t Pi but it is essential to do so in quantum mechanics. The exact order of 
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the factors in a term can often be decided by remembering the condition 
that the operator be Hermitian. Ambiguity can sometimes be removed by 
taking the mean of two possible orders of the factors. The classical pro- 
duct q t pi (or Pi q() can be represented by the Hermitian operator 



ti 3 ft 9 

i dq, i dq t 



It may be that the exact order can only be decided by trial and com- 
parison with experiment. The formulation in terms of rectangular position 
and momentum co-ordinates is least likely to lead to ambiguity. If necess- 
ary the operator can then be transformed to some other generalized co- 
ordinate system. 

For example, consider the Hamiltonian for a particle of mass m moving 
in a conservative field of force. The classical Hamiltonian is 

H = — (p 2 +p 2 +p 2 )+V(x,y,z) 
2m 

and the corresponding quantum mechanical operator is 

n 2 1 a 2 a 2 a 2 \ 
x= -M^a7 + ^) +nw) - 

There are other representations apart from the Schrodinger represen- 
tation and in these the operators will take a different form. Suppose 
instead of (3.8) the representative operators are taken as 

ft"*- -7 T- Pi^Pi t + L (3.9) 

i aq t 

The relation (3.6) is still satisfied in this 'momentum representation', and 
the quantum mechanical Hamiltonian corresponding to the example 
mentioned above is now 

The form of the potential function may make this representation difficult 
to use and unless otherwise stated the Schrodinger representation will be 
used throughout this book. 



Postulate 3 

In classical mechanics, Hamilton's equations determine the time variation 
of a state. Similarly, the Hamiltonian operator for the system determines 



QUANTUM MECHANICS I 59 

the time variation of a quantum mechanical state in that the state wave 
function * must satisfy Schrodinger's time-dependent equation 

jr*«*^. (3-10) 

of 

If the state function is known at some initial time, the equation (3.10) 
determines * at any other time. 

Equation (3.10) is a second order partial differential equation defined 
for all points where the potential energy is finite. As the second derivatives 
exist at all points (where Fis finite) then the first derivatives must be 
continuous and so * must be differentiable and continuous. These 
restrictions on the state functions were mentioned in Postulate one. 

Definition 3. Eigenvalues and eigenjunctions 

If $ is a function and a is an operator such that 

o$=a$ (3.11) 

where a is a constant then $ is an eigenfunction of a and a is the corre- 
sponding eigenvalue. The totality of all eigenvalues for an operator is 
called the spectrum of the operator and this may be discrete, continuous 
or partly discrete and partly continuous. The number of eigenvalues of a 
quantum operator is in general infinite. 
For example, the second-order equation 

— 7 +aX = 
dx 2 

has solutions X = A sin y/ax +B cos\/ax where A, B are arbitrary. So, 
when no boundary conditions are imposed the spectrum of the operator 
d 2 /dje 2 is continuous. On the other hand if the boundary conditions, 

X(0) = X(L) = 0, 

are imposed, the spectrum is discrete and the eigenvalues are 

nit^ 2 



"n= \-r) n = l,2, 



with corresponding eigenfunctions 

nn 
Xn{ x ) = -4 sm — x. 



Definition 4. Orthogonality 

Functions forming a set are mutually orthogonal if, for all pairs, 

f$> m dT = n*m, (3.12a) 
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the integration being over all the values of the co-ordinates. If in addition 
the functions are normalized to unity (3.1b) then 

J<J>** m dT = 6„ m . (3.12b) 

The functions are then said to form an orthonormal set. 
A well-known example of orthogonal functions is the set 

$„(x) = — — : sin mix 
the members of which are orthogonal over the range < x < 2-n, i.e. 

J ®n$m d* = 5„ m . 


Theorem 1. Reality of eigenvalues 

The eigenvalues of a Hermitian operator are real. 

Proof 

Let the Hermitian operator be a and $„ an eigenfunction belonging to 
the eigenvalue a n , i.e. 

a$„=a„<l>„ (i) 
J«I>*a*„ dr=a n J $**„ dr. 
Take the complex conjugate of (i) 

:. j (*$„)**„ dT = a* j*fa„dT. 
As the operator is Hermitian 

a„ =a* 
and the eigenvalue a„ is real. 

Theorem 2. Orthogonality of eigen functions 

Any two eigenfunctions of a Hermitian operator, belonging to different 
eigenvalues are orthogonal. 

Proof 

Let the Hermitian operator be a and the eigenfunctions be $„, $ m with 
corresponding eigenvalues a„,a m , i.e. 

<*<!>„ =a„<&„ (i) a<I> m =a m $ m (ii) 
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From (i) 

J$*a«K„dT=a„ j 4>* <!>„ dr. 

Since a is Hermitian 

J" $*<*<!>„ dT = ]"<!>„ (a<i> m )* dT 

= J**^*™ dr (from 00) 

as a m is real. 

(fln-«m)f*«*«d»' ss 
andasfl„ #a m then 

f$^$„dT = m^w. 

Postulate 4 

Suppose the wave function *(q u . . ., <7„ , describing a state is an eigen- 

function of the operator a representing the observable A. i.e. 

Then in this state an exact measurement of the observable will yield 
precisely the constant real value a. (As a is Hermitian a is real.) 

The time-independent Schrbdinger equation 

When the potential energy of the system is not a function of time, the 
time-independent Schrodinger equation (3.10) has variable separable 
solutions of the form 

*(«i, • • -><?mO = <K<7i «n)T(t). (3-13) 

(Note, the Greek \jj is used to designate the spatial part of the state 
function *.) Substitute (3.13) into (3.10) and divide through by * 

i^^dT (3.14) 

E is a separation constant independent of q t and t, i.e. 

*, = *,«"¥*/' (315) 

with 

is a solution of (3.10). 
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Equation (3.16) is the time-independent Schrodinger equation. \j/f 
(also */) is an eigenfunction of Jt?; Ef is the corresponding eigenvalue 
and is the energy of the state described by ^7. Such states in which the 
energy has a definite value are called stationary states of the system. 

Consider a free particle of mass m moving along the jc-axis with no 
force acting. The classical Hamiltonian isp 2 /2m and the Schrodinger 
equation governing the motion is 

ft 2 3 2 * 9* 

= m—. (3.17) 



2m dx 2 dt 

The spatial parts \p of the variable separable solutions satisfy 

2m dx 2 
and so 

*(x)=Ae ikx +Be-' kx 
with 

2m 

The boundary conditions would determine the arbitrary constants and 
also the allowed energy values. The complete variable separable solution is 

F F 

¥(x, i) =AA kx -*<)+ Be'*"* + n <\ (3.18) 

Observe that the two separate solutions are also eigenfunctions of the 
momentum operator (fi/i) (d/djc) with eigenvalues ±tik and represent 
uniform motion in the positive and negative directions respectively. 

Definition 5. Linear independence 

A set of functions {<£,- } is said to be linearly independent if the relation 

a\4>i +a 2 <p2 +• • ■ + «n0« = (3-19) 

implies that the constants a,- = for all 1. This means that none of these 
functions can be expressed in terms of the others. 

Theorem 3 

Any set of mutually orthogonal functions is linearly independent. 

Proof 

Let the orthogonal set of functions be {<&•} and suppose that (3.19) is 
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satisfied. Multiply equation (3.19) by ft* and integrate over all values of 
the co-ordinates. 

2 a t f ft*ft dr + a t fft*ft dr = 0. 

As the functions are orthogonal then the first term vanishes and so 
a/ j"ft*ftdT = 0. 

Clearly this integral is non-zero and so a/ = 0. This is true for all/ and so 
the orthogonal functions are linearly independent. 

Conversely, if a set of non-orthogonal functions are linearly indepen- 
dent and are also quadratically integrable, it is always possible to form a 
mutually orthogonal set from them by a suitable linear transformation. 
This is the Schmidt method. Suppose {0,- } is the non-orthogonal set, 
then an orthogonal set {ft} can be defined as follows. 

Let 0i = 1, and then choose a with ft* = 6 2 + <*0i so that ft, is 
orthogonal to ftj , i.e. 

a = - j<tf0 2 dT/jft>idT. 

Next choose 0, , 2 with ft, = *3 + Mi + fc*2 so that *3 is ° rth °g° nal 
to <t>i and 02 etc - 

Definition 6. Function space 

The function space defined by the linearly independent set of functions 

{0,} includes all functions of the form 

b i e l +b 2 d 2 + ... + b n e n ( 3 - 2 °) 

where Z>,'s are any complex constants. The number of linearly indepen- 
dent functions used to define this space, gives the dimensionality. Clearly 
the orthogonal set of functions {ft} obtained from {0,} by the Schmidt 
process define the same space. Either of the basis sets {ft} or {»,} are 
said to span the space in that any function in the space can be expressed 
as a linear combination of the members of either set. This definition is in 
accordance with the more usual ideas of Cartesian space. The unit vectors 
ii.h.U are orthogonal vectors and are also linearly independent. 
Together they define a three-dimensional space which includes all vectors 
of the form 

r = cJi +c 2 h + c 3 h- 

The space defined by the functions {jq} is said to be a proper sub- 
space of the function space {0,} if every function in the sub-space 
belongs to the space {0,} and if the converse^ is not true. 

The plane defined by the unit vectore h, l 2 is a proper sub-space of 
the three-dimensional space given by li , 1 2 , 1 3 ■ All the vectors in the 
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sub-space are of the type 

r = c 1 l 1 + c 2 l 2 , 

and obviously belong to the full Cartesian space. Alternatively, any 
vector including 1 3 while belonging to the three-dimensional space does 
not belong to the sub-space. 



Definition 7. Degeneracy 

There may be several eigenfunctions of an operator belonging to the 
same eigenvalue. If there are n linearly independent eigenfunctions 
corresponding to a given eigenvalue, the eigenvalue is said to be w-fold 
degenerate. These eigenfunctions can always be chosen to be orthogonal 
by the Schmidt process. 



Postulate 5. Expectation value of an operator 

Suppose there are a large number of identical systems each described by 
the same normalized state function, *. If ^ is an eigenfunction of the 
operator a representing the observable A then an exact measurement of 
A will yield the corresponding eigenvalue. In the more general case the 
state function is not an eigenfunction of a. It is now postulated that an 
exact measurement of A must yield one of the eigenvalues. In general, 
different values will be obtained for each system and the average value 
(sometimes called the expectation value), of the results is 

(A) = j<P*a*dT. (3.21) 

Probability density 

From (3.21), the expectation value of the co-ordinate q t is 

<<?,•> = J"**<fr*dT 

if ^ is normalized. This implies that *** dr represents the probability 
that the co-ordinates lie between q t and q t + dq t . That is, "9*^/ is a 
probability density. 

For a single particle system, the wave function depends upon the 
co-ordinates of the particle x, y, z and ¥** represents the probability 
density of the particle. It is no longer possible to state with certainty 
the position of the particle. 

Postulate 6. Completeness of eigenfunction sets 

Postulate 1 says that every state of a system is described as completely as 
possible by a wave function. At any instant, all such state functions 
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belong to some function space which is characteristic of the system. 
(This space is an abstract Hilbert space. The reader is referred to Quantum 
mechanics, by G. L. Trigg.) As the Hamiltonian operator has an infinite 
number of linearly independent eigenfunctions each of which can 
represent a state of the system, then it is clear that the function space is 
of infinite dimension. 

Suppose */ is a state function of the system and is also an eigen- 
function for the operator representing some observable of the system. 
It is now postulated that the infinite set of all such orthogonal eigen- 
functions span the system function space. That is, the set of eigen- 
functions is complete in that any state function can be expressed as a 
linear combination of them. For the moment it will be assumed that the 
operator spectrum is discrete and 

* = 2c/*,. ( 3 - 22 ) 

i 

If the eigenfunctions are quadratically integrable and are normalized 
the expansion co-efficients 

C/ = J^*-^dT. (3.23) 

Equation (3.22) shows that any state can be expressed as a superposition 
of other states. 

The reader will no doubt have met the concept of a complete set of 
functions in the theory of Fourier series. The functions \, sin nx, cos nx, 
n - 1 , 2, . . . form a complete set in that any piecewise continuous 
function f(x) with period 2ff can be expressed as a linear combination 
of them, i.e. 



f(x) = — + 2 (a n cos nx + b„ sin nx) 

2 i 



with 



a„ = - I f(x) cos nx dx and b„ = - f(x) sin nx dx 

■n J ff J 



Theorem 4 

From the previous postulate, any state function * can be expressed as a 
linear combination of the orthonormal eigenfunctions {*/} of the 
operator a representing the observable A (3.22). The corresponding eigen- 
values are a,-. An exact measurement of A must yield one of these eigen- 
values and the probability of observing a t is 

P,=cTc t =\c,\ 2 . (3-24) 
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Proof 

Expand * as a linear combination of the orthonormal eigenfunctions 
{*/}. From (3.21) the expectation value of A is 

(A > = f 2c**/*« 2cj*j dr = 22ctc )aj f *,**, dr = 2 \c, \\. 

i j if J i 

So I c ( | 2 can be taken as the probability of observing the eigenvalue a t . 
Note that if * is normalized 

f***dT = 2|c/| 2 = l. (3.25) 

J t 

If the value a t is obtained on measuring A then assuming the eigen- 
value is non-degenerate, the act of measurement must have disturbed the 
system changing the state function from * to the eigenfunction *,-. In 
general this function will change with time but if the observable A is 
immediately measured again the value a f will once more be obtained. 

If the eigenvalue is degenerate, (3.24) must be summed over the 
degenerate eigenfunctions. 

Continuous spectrum 

The equations (3.22-3.25) assume the operator has discrete eigenvalues. 
There are operators however which have eigenvalues running continuously 
over all or part of the spectrum. Suppose a is an operator for which the 
eigenvalues are continuous. (An example is the position operator.) The 
generalization of (3.22) for an arbitrary wave-function * is 

*=J"c(«)* a d« (3.26) 

where the integration is over all permissible values for the eigenvalue a. 
The problem of finding the expansion coefficients is complicated by 
the fact that the eigenfunctions * a are not quadratically integrable. If 
the wave-function * is quadratically integrable then from (3.26) 

jV** dr = J[Jc*(fl)** da]* dr = jc*(a)[j*%V dr] da 

assuming the order of integration can be reversed. It would seem reason- 
able that \c(a)\ 2 da should represent the probability that the observable 
has a value between a and a + da and as the sum of all probabilities is 
unity (cf. (3.25)) 

j\c(a)\ 2 da=l. (3.28) 

If * is normalized to unity 

Jc*(a)[J*** dr] da = \ c*(a)c(a) da 

c(«)=J"***dr. (3.29) 

Compare this result with (3.23) for a discrete spectrum. 
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The reader is asked to compare (3.26) and (3.29) with a function 
f(x) satisfying /"«, l/(x)l 2 dx finite, and its Fourier transform g(y). 

oo 

JTn J 
with 

It is now shown that the eigenfunctions of an operator with a con- 
tinuous spectrum are not quadratically integrable. From (3.26) and (3.29) 

c(«)=j"**[fc(a> a 'dfl' dr. (3-30) 

Reversing the order of integration 

c(a) = jc(a')[j**1V<lr da. (3-31) 

(Strictly this step is not permissible since the inner integral in (3.31) is 
divergent.) For this result to hold true for arbitrary wave-functions then 
the value of c(a) cannot depend upon the value of c(a'\ a' # a, i.e. 

J*** fl 'dT = a'* a (3-32) 

In addition, to satisfy (3.30) and (3.31) 

J[J*X'dr]d«'=l. (3.33) 

These results imply 

J"*** a dT = °°. (3-34) 

The 'improper' function with these properties is the Dirac delta function 
6(x) 'defined' by 

5(x) = x¥=0 

8(0) = °° 

Js(x)dx = l. (3.35) 

Clearly the delta function is not analytic in the usual sense and a 
more formal definition may be given as the limit of a sequence of 
analytical functions satisfying (3.35). There is not a unique sequence 
with this property but a useful one is 

6(je)=lim ElLp (3.36) 

e-K*> TtX C 
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Figure 3.1 Delta function sequence. 

The reader is asked to confirm that (3.35) is satisfied. Note that 

oe 

sin 2 j3 



I 



d/3 = 7T. 



Strictly the delta function is only meaningful under an integral sign and 
the limiting process taken after the integral is evaluated. 
From (3.32), (3.33), (3.34), (3.35), 



JV*>*Vdr = 5(a'-a). 



(3.37) 



Equation (3.37) is the generalization of (3.12b). The eigenfunctions 
for different eigenvalues are still orthogonal but they are not quadratically 
integrable and cannot be normalized to unity. 

The corresponding result in Fourier transform theory is 



— f e-" y ef xy ' dx = 8(y-y'). 
2n J 



PROBLEMS 

1 Classically, the angular momentum of a particle with respect to the 
origin is L = r A p where p is the linear momentum. Show that in the 
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Schrodinger representation the x-component of the angular momentum 
operator is 



Confirm that the commutator 

2 The classical Hamiltonian for an electron (charge -e) in an electro- 
magnetic field is 

tf = ^(p + eA) 3 -e*. 0-45) 

When expanded this becomes 

1 2 

H = — + — v.A+^—A 2 -e<j>. (a) 
2m m 2m 

Alternatively it could be written 

2 2 

# = _P_ + -i_ (p . A + A .p) + -i- y 42_ e0 . (b) 
2m 2m KV 2m 

Write down the corresponding operators using the substitutions {3.8) and 
show that (b) produces a Hermitian operator whereas (a) does not. The 
operator obtained from (b) is the correct quantum-mechanical Hamiltonian. 

3 The classical Hamiltonian for a linear harmonic oscillator is 

H = — p 2 + — moP-x 2 . 
2m 2 

Derive the Hamiltonian operator in both the Schrodinger and momen- 
tum representations and write down the corresponding time-independent 
Schrodinger equations. Explain why the eigenfunctions of these two 
equations are identical in form. 

4 Schrodinger's time-independent equation for a particle moving along 
the x-axis in a conservative field is 

fi 2 d 2 t// 
2w dx 
where V(x) is the potential energy and E is the energy. Suppose there is 
a potential well defined by 

V=-W -t<x<t 
V = \x\>t. 
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Integrate the Schrodinger's equation across this well. Show that if 
V -*■ —°° as t ->■ in such a manner that 2 Vt = - k (finite) then 



#\ _ /dj/A = 2m 
dxL \dxji 



2 W 



\dxj r 



ldx/ r \dx/i 



are the right hand and left hand derivatives at the origin. This result 
illustrates that the derivative of the wave-function is not continuous at a 
singularity in the potential. 

5 Consider the functions 

\\z 1 {x) = e~ x , \jj 2 (x)=e- 2x , \p 3 (x)=e- 3x , etc. 

defined in the interval < x < <*>. Show that these functions are quad- 
ratically integrable in this interval and normalize them. Using the Schmidt 
method obtain an orthonormal set of functions. 

6 Show that the function space (x 3 , y 3 , x 3 +y 3 ) has the dimension two 
and not three. 

Similarly show that the functions space (xy 2 ,yz, z 2 x, x(y 2 - z 2 )) has 
dimension three and not four. 

7 A periodic function with period 2ir is defined by 

f(x)=x —TT<X<1T 

Show that the associated Fourier series is 

2 (-1)" ' -sinnx. 

n=\ n 

8 Suppose {0,(x)} is a finite set of n real orthonormal functions in the 
interval (a, ft), i.e. 

b 

j<ktfydx = 6, / . 

a 

Let f(x) be some real function and define 'Fourier constants' c f by 
b 
Ci = J f(x)<t>i (x)dx i=l,...,n. 
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Consider the integral 

b 



J=\[f(x)-L n (x)] 2 dx 



where L n (x) is some linear combination of the orthonormal functions 
and is written 

L„(x)= 2jMx). 
i = l 

Show that the values of the constants ?,- that make / least are 

It = ti- 
lt may be assumed that the integral 

b 

j f\x) Ax 

a 

exists. 

Explain why 



c?+d+...+c*< J/ 2 (x)dx 



This result shows that the sum of the squares of the Fourier coefficients 
converges. 

9 The general form for a second order, linear homogeneous equation is 

(\ 2 Y AX 

ax ax 

where X is a constant. Show that by multiplying through by the integrat- 
ing factor 

1 Jt «* 



this equation can be put into the self -adjoint form 
+ [q(x) + Xp(x)]X = 0. (a) 



_d_ 
ax 



aX 

r(x) aT 



10 Suppose X t and X2 are solutions of the self-adjoint equation 
(equation (a) in question 9) corresponding to different 'eigenvalues' 
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Xi and X 2 an d that both solutions satisfy the same boundary conditions 

a^X + a 2 — = atx=a 

dx 

dX 
b 1 X + b 2 — = aix=b. 
dx 

By multiplying the equation satisfied by X x through by X 2 and that 
satisfied by X 2 through by X x integrating from x = a to x = b and 
subtracting the two, show that provided Xj =£ X 2 

b 

f p(x)X 1 X 2 dx = 0. 

a 

That is, the two solutions are orthogonal with respect to the weight 
function p(x) over the range (a, b). 



1 1 Legendre's differential equation is 
(l-x 2 )~ -2x~ + lQ+l) = 0. 



Put this equation into self-adjoint form (question 9), and verify that 
r(x) = 1 - x 2 . Observe that r(l) = r(- 1) = 0. When / = n (integer) a 
solution of the equation finite at x = ± 1 is the Legendre polynomial 
P„(x). Following a method similar to that used in question 10 show that 
+ i 

f P n (x)P m (x) dx = n =£ m. 
-l 
Note that no boundary conditions need be imposed upon the solutions 
in this case. 

12 The Dirac delta function has the property 

oo 

j"5(x)d* = l. 

Suppose 5 '(x) is defined as the 'derivative' of 5 (x). Show that this 
derivative has the property 

oo 

$ 8'(x)F(x) dx = -F'(0). 

— oo 

13 Using equation (3.29) substitue for c(a) in (3.26). Hence derive the 
'closure relation' 

J"* a V)* a (<7) da = S(q[ - q 1 )8(q' 2 - q 2 ) . . . 8(q'„ - q n ). 
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14 In the Schrodinger representation, the operator representing the 
co-ordinate q is itself. The corresponding eigenvalue equation is 

q denotes the actual value of q. (Of course q has a continuous spectrum.) 
Explain why the eigenfunctions are of the form 

*«' = 5(<7 -q')- 
Using (3.26) expand an arbitrary wave function <f(q) in terms of the 
co-ordinate eigenfunctions. Show that the expansion coefficient 

c(q') = *(q). 
Deduce the probability that the value of the co-ordinate lies between q 
and q' + dq' is 

\*(q')\ 2 dq' 
(•$*■$ has already been shown to be the probability density.) 

15 The quantum mechanical expectation value for the kinetic energy 
of a particle of mass m is 



2m J 



2m J 

where \jj is the state function and the integral is over all space. Substitute 
for the Laplacian operator using the identity 

div (A grad B) = A V 2 B + grad A . grad B 

where A , B are scalar -functions. 

If the wave function falls off faster than r' 1 confirm that < T) is real 
and positive. Deduce that the allowed energies of a particle satisfy 

EXV) 

where V is the potential energy. 

(Wave functions often fall off as e~ ar . Hint: use the divergence 
theorem 



diva dr = 9a. dS\. 
s I 
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CHAPTER 4 



Schrodinger's Equation 



4. 1 The Classical Limit 

The fundamental equation of classical mechanics is the Hamilton-Jacobi 
equation, 

/ 35 35 \ 35 ., _„ 

"("•■■■•"•^•■•■•a^'r-*- (L62) 

For a conservative system, the Hamiltonian is time-independent and the 
equation reduces to 

«{« •-£ £)-* (,63) 

where E is the constant energy of the system. 

There is an obvious superficial relationship between these equations 
and the basic equations of quantum mechanics. The time-independent 
Schrodinger equation which every state function must satisfy is 

^...,«„, 7 ^,..., 7 ^,r)* = - 7 -. (3.10) 

When the Hamiltonian does not depend on the time there are stationary 
states for which the energy is defined and they are obtained by solving 
the time-independent Schrodinger equation 
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Quantum mechanics includes classical mechanics as a limiting case and 
the Hamilton-Jacobi equation is valid when ft is small relative to other 
quantities with the same dimensions (i.e. as h -* 0). 

A similar situation occurs in electrodynamics where wave optics 
includes geometrical optics as the limiting case when the wave-length is 
small compared to the dimensions of the physical problem (i.e. as X -> 0). 
In a dielectric medium with zero charge density and zero current density, 
the components of the electric field E and the magnetic induction B 
satisfy the wave equation 

'■'-;?g-o (41) 

where u{x, y, z) is the velocity of propagation of the wave. When the 
velocity is a constant there are plane wave solutions of the type 

y =5e i(k.r-wf + oO (4.2) 

The direction of propagation is normal to the wave surface that is the 
locus of points which have the same phase at a given time. In the more 
general case there are solutions of the form 

f=B(x,y,z,t)e'*< x '>'' z ' t) (4-3) 

where the phase (eikonal) $ does not have the simple form of (4.2). In 
the limiting case when $ changes by large amounts in small distances 
(corresponds to X -* 0) substitution of (4.3) in (4.1) shows that the phase 
must satisfy the eikonal equation 

a*\ 2 +(— V+l— 1 =-U^\ . (4.4) 



9£\ 2 = J_/a*\ 2 

9z / [H 2 ]\ 3f ] 



This is the fundamental equation of geometrical optics. For waves of a 
single frequency co 

<P(x, y, z, f) = <t>(x, y, z) - ut (4.5) 

and the eikonal equation is 

'^Y + l^\\[21\ = ±L (4.6) 



dx] \dy 






This equation is analogous to the Hamilton-Jacobi equation for a particle 
of mass m in a conservative field, 

In geometrical optics the paths of rays are specified by Fermat's 
principle. One statement of this is that the ray-path is such that the dif- 
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ference between its phases at the fixed end points takes a stationary 
value. In classical mechanics the motion of a particle is governed by 
Hamilton's principle. All this suggests a correspondence between the 
phase of a wave and Hamilton's characteristic function S. 

The wave function for a single quantum particle satisfies Schrodinger's 
equation 

h 2 h 9* 

_- 5-V 2 * + K* = --— . (4.8) 

2m i dt 

In analogy to (4.3) try a solution of the form 

*(x, y, z, i) = B(x, y, z, t) e m *> y - z ' f) (4.9) 

and write 

— J fr»*'> . (4.10) 

h 

The constant h is introduced as the phase is dimensionless whereas S and 
h have dimensions of action e.g. Joule sees. Schrodinger's equation 
becomes 

- ^- V 2 B - — grad B. grad S -^-BV 2 S + ^- (grad S) 2 + VB 
2m m 2m 2m 

-£'♦*£ (4I1 » 

In the limit of small fi, this equation becomes 



bS\ 2 (MY + (dS\ 
dx) + \dy] + \dz) 



2 dS 

+ V2m = -2m—. (4.12) 

at 



In the same approximation, the time-independent Schrodinger equation 
becomes 



~:J + [dy) + [bz 



M|%(^\(^ 2 =2m( ^ F) (4,3) 



which is the Hamilton-Jacobi equation (4.7). In the limit that terms in fi 
can be ignored the wave function changes according to classical rules. 
From equation (3.6), if fi can be replaced by zero then all commutators 
are zero. This means that quantum-mechanical operators may be replaced 
by classical variables. 

The brief treatment above has indicated how quantum mechanics 
includes classical mechanics. Historically the approach was quite the 
reverse. Schrodinger argued that if geometrical optics required extension 
to include wave optical effects such as diffraction, then perhaps in 
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analogy, classical mechanics could also be extended. This lead to his 

theory of wave mechanics. 
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4.2 One-Dimensional Problems 

In these idealized problems a single particle is confined to move in one 
dimension, along the x-axis, say. They are worth consideration since they 
bring out the essential elements of a quantum mechanical solution using 
only elementary mathematical techniques. In fact only a few real quan- 
tum mechanical problems have a rigorous analytic solution. 



Potential barrier of infinite width 

A simple example of this type of problem is a potential step function. 
The potential energy of the particle is defined by 



V(x)=0 x<0 

= V x>0. 



(4.14) 



It is desired to find the eigenfunction solutions of the time-independent 
Schrodinger equation 



h 2 d 2 J/ 
2m dx 



(4.15) 



Incident 
particles 



\ / 

\ / 



V 



Potential 

Real part of 
<^(x) when 
E< V 



Figure 4.1 Step function potential. 
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To the left of the origin the potential energy is zero and there are no 
forces acting. The particles are free and the eigenfunctions are 

\jj(x) = Ae ik > x + Be~ ik ' x x<0 (4.16a) 

with 

E = h 2 k\l2m (4.16b) 

where A, B are constants. 
To the right of the origin 

-— —i +(V o -E)^ = x>0. 
2m ax 

The eigenfunctions are 

ip (x) = Ce ik * x + De~ ik > x x>0 (4.1 7a) 

with 

(E-V ) = h 2 kll2m. (4.17b) 

C, D are also constants. 

Solutions of this problem exist only for E > 0. Then the energy 
spectrum is continuous and the eigenfunctions are not quadratically 
integrable although i// can still be normalized over a finite volume. If a 
flux of non-interacting particles flows to the right from minus infinity 
\\)*\\j can be chosen to represent the particle density. Some particles 
may be reflected by the potential barrier and some transmitted The 
terms Ae ik ^ x and Be~ ik ^ x in (4.16a) respectively represent the incident 
and reflected particles. There are two cases to consider. 

(a) When E > V then classically all particles would be transmitted 
through the barrier. This is no longer true and \B/A\ 2 gives the intensity 
of the reflected wave relative to that of the incident wave. In this case k 2 
is real and the term e~' k ^ x (4.17a) represents a wave travelling to the left. 
Clearly for x > no particles can flow to the left and D must be zero. 

As the potential is finite \p, di///dx are continuous at the origin, i.e. 



(4.18) 



and 



A + 


B = C 

- B)k x = 
ki — k 2 
k\ "** k 2 


Ck 2 . 

y/E- 






B 


-y/E- 


-V 


A 


y/E 


+ s/E- 


-V 


C 


2k x 




2y/E 




A 


ki +k 2 


VF + 


yjE- 


V 



(4.19) 
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(The absolute values of A, B, C may be fixed by normalizing i//.) In the 
limit E -* °°, B = and C = A. Quantum mechanics is then in agreement 
with classical mechanics and all particles are transmitted. 

(b) When E < V then classically no particles would be transmitted. 
This is also true in quantum mechanics. In this case k\ is negative. Only 
the positive imaginary root is allowed as the negative imaginary value for 
k 2 represents a divergent solution that increases exponentially with x. 
That is D is again zero. Equations (4.18) and (4.19) still apply (with 
k 2 = i I k 2 1) and B/A is a complex number of modulus unity. Then the 
reflected wave has an intensity \B/A\ 2 = 1 corresponding to a reflection 
of all particles. Even so there is a non-zero probability that a- particle will 
penetrate the barrier to a depth x. From (4.17a) this probability is 

| *(x) I 2 = \A I 2 ^r ex P [-2y/2m(V - E)x/h] x>0. 

Classically this could imply that the particle has negative kinetic energy 
for x > 0. This surprising unphysical result is explained in quantum 
mechanics by considering the uncertainty principle (Chapter 5). 

A useful limiting case occurs as the potential step is increased without 
bound. From (4.18) and (4.19) 

lim B = -A and lim C = 0. 

K -—~ V a --- 

In this case the wave function becomes 

4,(x) = 2iAsink l x x<0, «K*) =0 x>0. (4.20) 

The wave function vanishes at a boundary where the potential is 
infinite. This result is valid in three dimensions. 

f 
Tunnel effect 

Another problem where the energy spectrum is continuous occurs in the 
motion of particles past a potential hill. As an example consider a 
rectangular potential barrier 

V(x) = x < / 

= V 0<x<a 
= x>a. (4-21) 

Again solutions exist only for E > 0. (Note: the minimum potential 
energy is zero.) 

The most interesting solutions occur when the energy of the particles, 
incident from the left, is less than the height of the potential barrier. 
Classically none of these particles would be transmitted but this is not 
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V 




Potential 

— — — Real part of 
\ji(x) when 
E< V 

Figure 4.2 Rectangular potential barrier. 

the case in quantum mechanics. The Schrodinger equations to be solved 
are 



n 2 d 2 \p , x <o 

- =■ = E\b 

2m dx 2 x >a 




fi 2 d 2 ip 
2m cbr 


< x < a. 


When E < V , the solutions are 




4>(x)=Ae ikx +Be- ikx 


x<0 


Hx) = Ce ax +De- ax 


0<x<a 


4>(x) = Fe ikx 


x>a 


with 




k=-(2mE) 1 ' 2 and 


a - — [2m 
n 



(4.22) 



(4.23) 



a is real. Note that there is no reflected wave for x > a and so there is no 
term involving e~ ikx . 

The boundary conditions are that the wave function and its derivative 
be continuous at x = and x = a. These determine the values of the 
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constants relative to A. (As in the previous problem the absolute values 
may be fixed by normalizing V-) The continuity of \p and its derivative 
at x = give 

A+B=C+D 

ik(A-B) = a(C-D). 
Similarly at x = a 

Ce aa + De- aa = Fe ika 

a(Ce aa -De-°"') = ikFe" ca . 
These equations can be written 

BCD 



A A 






, B CD., 
-ik — -a — + a — = -ik 

AAA 



e aa — + e 
A 



,—aa 



D 



F 
■e ,ka — = 



ae aa £_ ae 
A 

Solving for F/A 

F 



n F 

- aa —-ike ika - = 0. 

A A 



4iak 



A e ika [Aiak cosh aa - (a 2 - k 1 )2 sinh aa] ' 



(4.24) 



The number of particles transmitted through the barrier is proportional 
to|F/>l| 2 ,i.e. 

F\ 2 4 

A 



4cosh 2 aa+( — i sinh 2 aa 

If aa is large then both cosh aa and sinh aa behave as e aa l2 and 



F_ 
A 



16a 2 k 



.2 7,2 



-2aa 



~(a 2 +k 2 ) 2C * 
The number of particles transmitted falls off exponentially as 

11/2 



(4.25) 



exp 



>.-o 



Note that in the case that h can be neglected (i.e. h -* 0) the classical 
limit of no transmission is obtained. 
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Figure 4.3 (a) The potential function for an a-particle in a uranium nucleus, 
(b) The potential function in a crystal. 



Tunnelling of particles occurs in radioactive decay. In the case of 
uranium an a-particle experiences the repulsive nuclear electrostatic field 
up to about 10 -14 m from the nucleus. Inside this distance the forces 
are attractive and the a-particle is in a potential well. It is observed that 
a-particles are emitted with energies less than that necessary to penetrate 
the barrier of the nuclear field. For example in the case of "fU the 
height of the potential barrier is about 30 MeV whereas the kinetic 
energy of the emitted a-particles is only 4-2 MeV. 

The smaller the mass of a particle the greater the probability of 
transmission. This explains why in metals electrons can move from atom 
to atom through the potential barriers. If the barrier height is taken 
~ 1 eV and a ~5 A the barrier is not very transparent and the electron 
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has a probability of about 10" 2 of penetrating at each 'collision' with 
the barrier. However if it is considered that the electron is oscillating 
inside the potential well with wave-length equal to twice the Bohr radius 
(a = 5-32 x 10 -n m) then from de Broglie's hypothesis (2.42) the 
electron velocity will be hj2m a and will collide h\2m a\ ~ 10 17 times 
per second with the barrier. Such an electron will escape through the 
barrier after ~10~ 1S seconds. Clearly it cannot be considered that such 
electrons are bound to particular atoms but rather that they travel 
throughout the crystal. 

In the case when E > V it can be shown in a similar manner to the 
above analysis, unlike the classical case, that not all particles are trans- 
mitted. Some are reflected. 



Harmonic oscillator 

Consider a particle of mass m oscillating in one dimension. Classically if 
the force acting is - kx then the potential energy of the system iskx /2. 
The Hamiltonian operator is 

JT = - — -^- + |*x a (4-26) 

2m dx 2 2 



and the time-independent Schrodinger equation is 
ft 2 d 2 i// 1 



l+itoc 2 *^. (4-27) 



2m dx 2 2 
It is usual to introduce the non-dimensional quantities 

x (4.28) 




-( 



rnkV' 4 



and then 

£*+ (,_/)*.. a (4.29) 

The potential energy tends to infinity as | x I -> °° and its least value 
is zero. Solutions exist for E > and the wave function must vanish 
as | jc I -*■ °°. It is well known that (4.29) possesses solutions which tend 
to zero as |jc | -> oo if and only if y takes one of the values (1 + 2n) where 
n is a positive integer. The corresponding solutions are 

*{y)=Ae- y *l 2 H n (y) (4-30) 

where H„(y) is the Hermite polynomial of degree n. (See Special functions 
of mathematical physics and chemistry, by Sneddon.) The energy spectrum 
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is discrete and so the eigenfunctions are quadratically integrable. A is a 
normalization constant. The first four polynomials are 

H 1 (y) = 2y 

H 2 (y) = 4y 2 -2 

H 3 (y) = 8y 3 -12y. (4.31) 

The wave function ip n (y) is odd or even as n is and has n nodes. 

Corresponding to the allowed values of 7 the discrete eigenvalues are 



1 \ fk 
E n = \n+-\fil- n = 0,1,2,... . 

2 Vm 



(4.32) 



The analogous classical oscillator has an angular frequency a> = (k/m) in 
and in terms of this frequency 



E„ = (n + $)hu> n = 0,1,2,.... 



(4.33) 
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Figure 4.4 Probability distribution for « = 12. The classical value is represented 
by the dashed curve. 
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The term nftco gives the Planck series of energy levels; the energy of 
the ground state is not zero but takes the value ftcj/2. This is called the 
'zero point' energy and is a manifestation of the uncertainty principle to 
be discussed in the next chapter. Classically a particle oscillator would be 
in the lowest energy state when at rest at the centre of the oscillation. 
But this gives a precise value to x and in quantum mechanics it is not 
then possible to give precise values to momentum and energy. For large 
quantum numbers the wave function probability density approximates 
to the classical probability curve for an oscillator with the same total 

energy. 

In this classical case the probability is least at the midpoint of the 
oscillation where the particle velocity is greatest. This is an illustration 
of the correspondence principle discussed in Chapter 2. 

All the oscillator eigenfunctions, except the ground state, have one or 
more nodes. The presence of nodes in a wave function leads to a classical 
paradox. As the probability density is zero at a node then apparently the 
particle must pass from one side to the other at infinite speed as it 
oscillates. The quantum mechanical answer to the 'nod« paradox' is that 
the momentum and position of a particle cannot be known simultaneously 
and so this trajectory problem does not arise. (For a further discussion 
see Kemble, p. 86.) 

The simplest application of the linear oscillator is in the discussion of 
the vibrational energy levels of diatomic molecules. 



4.3 Three-Dimensional Problems 



Particle in a box 

Consider a cube of side a with the origin of the co-ordinate system at a 
corner of the box. Inside the box the potential energy of an electron is 
zero and outside the potential is infinite. This is essentially the model 
for a metal suggested by Sommerfeld in 1928. 
The time-independent Schrodinger equation is 

5l* + 2!* + ?!* +Jk 2,.o (4-34) 

dx 2 dy 2 9z 2 

inside the box, where the electron kinetic energy is 

E~h 2 k 2 \7m. (4-35) 

As E > then k must be real. 

The partial differential equation (4.34) is solved by the method of 
separation of variables. By writing 

xKx,y,z) = X(x)Y(y)Z(z) (4-36) 
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equation (4.34) becomes 

1 d 2 X 1 d 2 Y 1 d 2 Z ,, 

The first term does not include y or z, the second does not involve x or z 
and the third does not involve x or j. This means that 

+ kl=0 



X dx 2 

1 d 2 Y 
Y dy 2 ^" y 

Z dz 2 



+ A: 2 =0 

+ *|=0 (4.38) 



where k x , k y , k z are constants such that 

k 2 =k 2 +k 2 +k 2 . (4.39) 

Each of the equations (4.38) has a general solution of the form 
X(x) = A sin k x x + 5 cos k x x k 2 >0 (a) 

Z(x)=^x+5 k 2 =0 (b) 

X(x) = >1 sinh | k x \ x + B cosh | k x | x kl<0 (c) (4.40) 

where A, B are arbitrary constants and of course are not the same in the 
three cases (a), (b) and (c). 

The boundary conditions require that the wave function vanishes at 
the box walls where the potential is infinite. The only solution which 
can satisfy these conditions is (4.40a), and the wave function (4.36) is 
then composed of terms of the type 

sin sin sin 

k x x k y y k z z. (4.41) 

cos cos cos 

This wave function must vanish at all points on the cube surface and the 
only product satisfying this condition is 

A sin k x x sin k y y sin k z z 

with 

_ In mn _nn 

a a a 
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where I, m, n are integers. The electron energy takes the discrete values 

£ = -^(/ 2 + m 2 +n 2 ). (4-43) 

8ma 

It is very important to note that it is the boundary conditions that 
quantize the energy. In general the energy spectrum is partly or wholly 
discrete if a region of space is surrounded by a potential wall. In the 
present problem the electron is localized within a cube by an infinite 
potential barrier whereas in the previous problem (oscillator) V(x) -*• °° 
as |x| -+ °°. The energy spectrum is wholly discrete in both these cases of 
a finite and infinite domain. 

However if the potential is such that V(x) < inside some closed 
region and V(x) = outside this region there is a discrete set of bound 
states for E < and a continuous spectrum of states for E > 0. The 
wave functions for these latter states are not quadratically integrable and 
represent 'infinite' motion of the system. 

More generally, if the potential tends to a finite value at infinity there 
will be a continuous energy spectrum for values greater than this. 

This problem is the basis of the free-electron model for solids. 
Although this application is apparently crude it is surprisingly useful. 



V(x) 




Figure 4.5 A finite potential well. 
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Rigid rotator 

It is convenient at this stage to discuss the system of two interacting 
point particles the results of which will not only be needed in the present 
problem but also in the following treatment of the hydrogen atom. 
Suppose the particles have masses m u m 2 and are at the positions (* 1; 
Ji,zi)and (x 2 , y 2 ,z 2 ) respectively and suppose that the potential energy 
is a function of the relative displacement of the particles, i.e. 



V= V(x 2 -x u y 2 -y u z 2 -z,). 
The classical Hamiltonian for this two-body problem is 
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where P u P 2 are the momenta of the particles. The corresponding 
operator is obtained by substituting 
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The time-independent Schrodinger equation is 
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(4.45) 



where 12 = Sl(x l ,yi,z 1 ,x 2 ,y 2 ,z 2 )andE is the energy of the system. 

When the potential takes the form (4.44) this equation can be 
separated into two equations one of which represents the system trans- 
lational motion and the other the relative motion of the two particles. 
This separation is obtained by introducing six new co-ordinates, first of 
all the co-ordinates of the centre of mass of the system (x, y, z) and 
secondly the co-ordinates of the second particle relative to the first. 
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(4.46) 
It is left to the reader to confirm that (4.45) becomes 
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where the 'reduced mass' of the system is 
m l m 2 



/* = ■ 



(4.48) 



mi +m 2 

It is now possible to separate this equation by writing 

a (i, y, % x, y, z) = fix, y, I) * (x, y, z) (4.49) 

When this expression is substituted into (4.47) and the equation divided 
by the product f\p it is found that the left hand side isindependent of 
(x, y, z) and the right hand side is independent of {x, y, z). Each of 
these parts must be equal to a separation constant which is conveniently 
taken as -2E t /h 2 . The resulting two equations are 

Equation (4.50) is simply Schrodinger's equation for a particle of mass 
(m i + m 2 ) moving in field-free space. The parameter E t represents the 
translation energy of the system. Equation (4.5 la) is the Schrodinger 
equation for a particle with mass equal to the reduced mass of the system 
moving in the potential V(x, y, z). The parameter E S = (E- E t ) represents 
the vibrational and rotational energy of the system. It is often useful to 
express (4.51a) in spherical polar co-ordinates. 

jc = r sin cos 0, y=rsin0sin0, z=r cos 0. 

Then 

1 A/ aW\ 1 9 2 * . 1 3 / . A W 

r 2 dr V 9r J + r 2 sin 2 90 2 r 2 sin 30 [ 9 30 

+ ^[E S -V]^=Q. (4.51b) 

Consider now the case of the rigid rotator where the masses are kept 
at a fixed distance a apart and where Fmay be taken as zero. The 
Schrodinger equation (4.51b) then represents a mass moving over the 
surface of a sphere of radius a, i.e. 

1 *i + J_ A L e **) + 2 -L E ^ = (4.52) 

sin 2 30 2 sin 30 \ M J h 2 

where / = na 2 . The wave function depends only upon and <t>. This 
equation may be separated by writing 

iK0,0) = 0(0)$(0) < 4 - 53 ) 
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and dividing by the product ©3>. 
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This can be written as the two equations 

/ + m 2 * = (4.54) 

d0 

d0 2 d0 ft 2 sin 2 

where m 2 is the separation constant. The solution of (4.54) is 

4> = A sin m0 + B cos m0. (4.56) 

Since a wave function must be single-valued and continuous $(0 + 2ir) = 
<I>(0) and so m must be restricted to integer values. If the substitution 
a = cos is used in (4.55) then the ©-equation becomes 

(1 _ a2) l!f_ 2a i® + ( x _^l)e = o -k«<i 

A ~ ' ' - ' (4.57) 
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where 

X = 2/£Vfi 2 . 

This second-order equation has two solutions but in general these are 
both infinite at a = ±1 (i.e. 0=0, ir). Such a solution is not physically 
acceptable. But if X takes one of the values 

X = K(K + 1) K = 0,1,2,... (4.58) 

with 

K>\m\ 

then one of the solutions is finite in the range - 1< a < 1 while the other i: 
not. The equation (4.57) is Legendre's associated equation (see Sneddon's 
book) and the acceptable solution is the associated Legendre function 
FT (a), i.e. 

0(6) =Pk (cos 0). (4.59) 

So the physical restrictions on the wave functions require the energy 
to take the discrete values 



2/ 
This simple model is useful in the theory of band structure of molecules. 



*,- ^V" JC-0,1,2 (4.60) 
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The infra red spectrum of diatomic molecules 

The more rapid electron motion in a molecule can be separated from 
the nuclear motion of a molecule (Born-Oppenheimer approximation) 
and the molecular energy (apart from translational energy) can be con- 
sidered to be composed of three parts 

(a) the electron energy, 

(b) the vibrational energy of the component atoms, 

(c) the rotational energy of the molecule as a whole. 

Each electron energy level has a fine structure of vibrational energy 
levels and these in turn have a rotational fine structure. This section is 
concerned with the fine structure due to the atomic vibrational and 
molecular rotational levels. 

Classically, an accelerating electric charge produces an electromagnetic 
field. Molecular vibrations and rotations involve oscillations of charges 
and produce emission (absorption) of radiation. The frequency of this 
radiation is equal to the frequency of the corresponding molecular 
motion. For diatomic molecules the vibrational frequencies are in the 
infrared region ~10 13 -10 14 Hz and the rotational frequencies are lower. 
Infrared and microwave spectroscopy of gases are powerful methods for 
helping to determine molecular structure. 

A diatomic molecule can be regarded as a dumb-bell rotator-oscillator. 
In the previous discussion of the rigid rotator the potential function 
V(r, d,<t>)m (4.51) was taken to be zero. A simple model for diatomic 
molecules is to assume the simple harmonic potential 

V(f) = hk(r-r f 

where r is the equilibrium separation of the nuclei and k is a force 
constant. A detailed analysis of the rotator-oscillator (e.g. Pauling and 
Wilson, p. 263) shows that to second-order terms the energy levels of 
the system are given by 

E n ,K = (n + i)*« + K{K + 1) ^r - K\K + l) 2 ^p 

n= 0,1,2,... 
£ = 0,1,2,... 

where co is the frequency of oscillation of the molecule and / is the 
moment of inertia. The first term is due to the molecular vibrations 
(4.33) and the second term is due to the rotation of the molecule (4.60). 
The third term is a correction term arising from the centrifugal stretching 
of the non-rigid molecule during the rotation. The corresponding eigen- 
functions are 

*n.K<T. , *) = Rn(rW$ (COS 6) e""* 

and R„(r) has the general form of (4.30). 
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Figure 4.6 The energy levels E„ g for the non-rigid rotator-oscillator. 

The infrared spectrum observed depends upon the allowed energy 
transitions and these are determined by the selection rules. In Chapter 8 
it is shown using the dipole approximation that a transition of a charge 
from a state i to a state / is possible if the integral (8.55) 

jx*rxi dr 

is non-zero where Xi and x/ are the initial and final state functions and r 
is the position of the charge. 

The properties of Hermite functions (Chapter 6, problem 14) restrict 
the allowed transitions for a harmonic oscillator to those between 
neighbouring energy levels so that the selection rule is 

5w=±l. 

Consequently the emission or absorption spectrum of a harmonic 
oscillator should consist of one sharp line with a frequency equal to that 
of the oscillator. 

Analysis of the rotator-oscillator eigenfunctions shows that the 
selection rules are 

5m = 0,±1,±2,... 8K = ±1. 

The transitions with SK = +1 form the .R-branch and those with SAT = —1 
form the P-branch. The infra-red absorption spectrum (no electron 
transition) of heteronuclear molecules such as HC1 conform to these 
rules. Each vibrational spectrum line has a fine structure (Fig. 4.7). 
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Figure 4.7 

Note that the rule 8K = ±1 means that there is no central line to the 

structure. 

There is no infra-red absorption spectrum for homonuclear diatomic 
molecules such as H 2 or 2 . This is of course to be expected as in the 
vibrations of a homonuclear molecule there is no net movement of charge. 

It can be shown that when the electron motion is taken into account 
the above selection rules are correct on the assumption that the electrons 
have zero angular momentum about the nuclear-axis. (2 state.) This is 
the case for almost all diatomic molecules (NO is an exception) in the 
ground state. However, if the electrons do have a non-zero angular 
momentum about the nuclear axis then the transitions with 5£ = are 
allowed and form the Q-branch. 

Transitions involving a change of electron energy are very complex 
and in this case homonuclear molecules can show a vibration-rotation 
spectra. For further information the reader is referred to Introduction to 
Quantum Mechanics, by Pauling and Wilson, and The Wave Mechanics of 
Atoms, Molecules and Ions, by Schutte. 

Central field Coulomb potential 

In this final section of Chapter 4, an outline treatment is given for the 
hydrogen atom problem. The first part of the previous section is of 
importance. Consider the motion of a single electron of mass m i and 
charge -e in the coulomb field of a nucleus of mass m 2 and charge Ze. 
The potential energy of the system depends only upon the separation of 
the particles and is 



V(r) = - 



Ze 2 
4ne r 



where e is the permitting of free space. 

The analysis from (4.44) to (4.51) applies and the energy (excluding 



ine analysis irom {_•*.•***) iu v'*--' 1 ; appu«» <" 1U 
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(4.61) 
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where r, 8, <l> are the polar co-ordinates of the electron relative to the 
nucleus. As m 2 > wij, the reduced mass n is effectively equal to the 
electron mass. This equation may be separated by writing 

yjt (r, B,4>)=R (r)0(0)*O) (4.62) 

and dividing by the triple produce R®&. The separated equations are 

—2+mf^ = (4.63) 

d 2 d0 f rrii ) 

_ + cot ,_ + /(/ + 1) __J_ = o (4 . 64) 



r 2 drV dr \ f, 2 



/ Ze 2 \ 1(1+1)} 



where mf and /(/ + 1) are the separation constants (see rigid rotator). 

As $ must be single-valued and continuous the constant mi must be 
an integer. Equation (4.64) is the transformed Legendre's associated 
equation and as the wave functions must be bounded in the range 
< 9 < n the required solution is 

= P, m '(cos 6) (4.66) 

with / a positive integer or zero and / > | m/ 1. 

The radial equation (4.65) may be solved by writing 



n 4ire ti a 

and changing the independent variable to p = 2ar. Then 



d 2 R 2 dR [ 1 /(/+1) , 
dp 2 p dp { 4 p 2 - l 

Observe that for bound states E s < and a is real. The physical require- 
ments of single-valuedness and continuity have been imposed on the 
wave-function but no boundary conditions have been applied up to this 
point. These boundary conditions are 

(i) \p -»• as r -*■ °° (ii) \p is finite as r-+0. (4.69) 

Equation (4.68) is discussed in books on special functions (e.g. Sneddon), 
and solutions that satisfy R -*■ as p -*■ °° and as p -*■ exist only if /3 is 
an integer n , such that n > / + 1 . The solutions are the Laguerre functions 

Rm(p) = e-" 2 p'Lffi (p) n > I + 1 (4.70) 
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where L%t \ is an associated Laguerre polynomial. The restriction on 
results in a discrete energy spectrum 

Es = -ZV/i/2ft 2 n 2 (47re ) 2 n = 1, 2, . . .. (4.71) 

The quantum number n determines the energy and the degenerate wave- 
functions associated with this energy are 

*«i*,fc «> <t>)=A„ lmi R nl (2ar)Pr(cos 6)j m <* (4.72) 

/ = 0,1 (n-1) 

m t = -l,-Q-l),...,0,...,+l 
where A„ lmi is a normalization constant chosen so that 

J \p*\p dr=l. 

all space 

The number of degenerate wave-functions belonging to the nth energy 
level is 

"2 (2/+l)=n 2 . 
/=o 

The quantum numbers / and m t are related to the angular momentum of 
the system and the degeneracy with respect to mi is associated with the 
rotational symmetry of the atom and occurs in any central field problem 
(see Chapter 9). However, the degeneracy with respect to / is a property 
peculiar to the coulomb potential and is not readily explained in terms 
of the atomic symmetry (see Kemble, Quantum mechanics, p. 312). 

The ground state (lowest state) of the electron occurs when n = 1 and 
the single normalized wave function is 

, ^ 1 ( Z Y 2 -012 hH « e ° 2Zr 

(4.73) 

The next level corresponds to n = 2 and has four degenerate wave 
functions associated with it. 

i3/2 



1 I Z \ 3 
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*2i ± i = -t-f I t- I P e ' p ' 2 sin 6e±ut> - (4 - 74) 
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From the last two it is possible to construct the functions 

(2p*) j I z \3/2 COS0 

= — j= — pe~ pl2 sin 6 (4.75) 

(2p y ) 4v2* \a J S in 

The formula (4.71) gives the basic structure of the atomic line 
spectrum but it does not give the fine structure. This is essentially a 
relativistic phenomenon and is connected with the concept of electron 
spin. 

Positive values of the total energy correspond to unbound states in 
which the electron kinetic energy is greater than the potential energy. 
The energy spectrum is then continuous and the wave functions are not 
normalizable. 



PROBLEMS 

1 Consider the one-dimensional Schrodinger equation (4.1 5). If i// 1( 
i/> 2 are two different eigenfunctions corresponding to the same value of 
the energy show that 

d 2 *i , d 2 ^2 , n 

Integrate the equation to obtain 

V / 2 — Wi = constant. 

ax ax 

If the spectrum is discrete both ty x and i// 2 vanish at infinity. By inte- 
grating a second time show that \pi and \p 2 are linearly dependent. This 
result shows that in a one-dimensional problem the energy levels of a 
discrete spectrum are not degenerate. 

2 Suppose the potential in a one-dimensional Schrodinger equation is 
symmetric, i.e. 

V(-x)=V(x). 

By carrying out the change of variable x -> — x show that in the non- 
degenerate case the eigenfunctions are either odd or even, i.e. 

3 Carry out the analysis of the tunnel effect in the case E > V (4.22). 
Show that contrary to classical predictions some particles are reflected. 

4 With the notation of (4.26) show that the frequency of a classical 
oscillator is cj = (k/m) V2 . From energy considerations show that the 
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maximum displacement of the particle is x - (2E/k) m where E is the 
total energy of the oscillation. 

Define P{x) dx to be the probability that the particle is in the element 
dx This probability must be proportional to time dr spent in dx. Using the 
harmonic solution x = x sin wr together with the relation d f = d x/x show 

that 

dx 

"It-* J 

(See Fig. 4.4.) 

5 The potential in a three-dimensional problem is defined by 
V=-V r<a, K = r>a V >0. 

(r is a spherical polar co-ordinate.) 

Show that the spherically symmetric bound solutions (E < 0) which 
satisfy the boundary conditions 

(i) i// is finite at r = 
(ii) \p and d^/dr are continuous at r = a. 
(iii) ip -*■ as r -> °° 

are of the form 



,/,=-*— r>a "-J— ~ 2 



A j—2mE 

r 



^/=-sin/3r r<a 0= J^( v o +E l 

(m is the electron mass.) 

Show that a necessary condition for a non-trivial solution is 

out = -0a cot (/3a). 

a and /3 must also satisfy. 

Illustrate these conditions graphically. 

Explain why bound solutions exist only if 

J2mV 7r 
J-lT a> 2- 
(The spherically symmetric Laplacian is 

r 2 dr\ r br 
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6 The potential in a one-dimensional problem is defined by 

V = -V \x\<a, V = \x\>a, V >0. 

The correct boundary conditions on the eigenfunctions are 

(i) \jj .-*■ as \x | ->• °° (Bound states). 

(ii) \jj and d\j/jdx are continuous at \x\ =a. 

Solve the time-independent Schrodinger equation. 
Show that if 



J 



2m\E\ 

— a>\ 



then the simplified boundary conditions i// = at \x\=a may be used. 
Verify that the eigenfunctions are either odd or even. 

7 The potential inside a cylinder of radius a and length I is - V (V > 0) 
and is zero outside. 

The co-ordinate axes may be chosen so that the z-axis coincides with 
that of the cylinder with the origin at the centre of the lower plane end. 
Schrodinger's equation in cylindrical polars (p, <f>, z) is 



_*1 

2m 



j__9_/ d_\ _1 
p 9p \ dpj p- 



2 d<A 2 " + 3z 2 



ip-[V o +E]xk=0 



p<a 0<z<l 0<4><2n. 



For \E\ sufficiently large the boundary conditions reduce to \j/ = on 
the cylinder boundary. Use the method of separation of variables to 
show that the solutions finite and single-valued within the cylinder are 
of the form 

. nnz cos r0 

\p=Asm——J r (ap) . 

/ sin r<j> 

where n is a positive integer, r is a positive integer or zero and J r (aa) = 0. 
Find the corresponding eigenvalues. (The Bessel function J r (x) is a 
solution of 

d.y 2 1 dj> / r 2 \ n 
dx 2 x dx \ x 2 y 

8 Suppose two interacting particles have masses m \ and m 2 and are at 
the positions (*i,yi,z{) and (x 2 ,y2, z 2 ) respectively, and suppose that 
the potential energy is a function of the distance between the particles. 
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The classical Lagrangian is 

L = f(x\ + y\ + zl) + f (xl+yl + zl)-V. 

Carry out the change of variables (4.46). Find the momenta conjugate to 
the six new co-ordinates and derive the classical Hamiltonian 

where ju is the reduced mass (4.48). 

9 The classical Hamiltonian for a rigid rotator is (see Chapter 1). 

where / is the moment of inertia. An alternative way of writing this is 

^TT^T [(sin0P e ) 2 +4]. 
21 sin 8 

Carry out the quantum substitutions 

ft 3 

Which one of the above Hamiltonians gives the correct Hamiltonian 
operator used in (4.52)? Verify that the correct operator is Hermitian. 
Comment on the results of this question. 

10 In the solution of the simple harmonic oscillator the wave function 
can be written in the form (4.30) where H(y) satisfies Hermite's 
differential equation 

H"-2yH' + (y-l)H=0. 

Use the method of solution in series to obtain the first three terms of 
the two linearly independent solutions. 

The physically acceptable solution must be composed of a finite 
number of terms. Find this solution and explain why 

7 = l+2ra, n=0,l,2,... 
if y n is to be the highest power, in the solution. 
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CHAPTER 5 



Quantum Mechanics II 



5. 1 Complete Description of a State 

It has been mentioned that in a quantum mechanical system not all 
observables are simultaneously measurable. The x-co-ordinate of a 
particle and the x-component of its linear momentum cannot both be 
known exactly at the same instant. However there do exist sets of 
observables that can be measured simultaneously. For example it is 
possible to know simultaneously the energy and the total angular 
momentum of a particle in a central field. 

The state of a system is known as completely as possible when the 
values of all the observables that can be simultaneously measured are 
known. 

Compatibility 

If two observables A andfi are simultaneously measurable in a particular 
state then if either is measured, an unique result is obtained. This implies 
that the state function is an eigenfunction of the representative operators 
a and 0, (Postulates four and five). 

The observables A and B are said to be compatible if there exists a 
complete set of state functions, each of which is an eigenfunction of the 
corresponding operators. 

The following two theorems illustrate the connection between com- 
patible observables and commuting operators and lead naturally to a 
discussion of degeneracy. 

Theorem 1 

If the operators a and have a complete set of simultaneous eigen- 
f unctions then the operators commute. 
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Proof 

Let *,- be one of the complete set of eigenfunctions of the operators a 
and with corresponding eigenvalues a t and 6/ 

a*, =a,-*,- 0*,=Z>,*,-. (5.1) 

Since the set of simultaneous eigenfunctions {*,- } is complete an arbitrary 
wave-function can be expanded 

*=2>/*/. (5.2) 

/' 

Consider the effect of the commutator a0 — 0a on *. 

(o0 - 0a)* = (a0 - Mlcjty. (5.3) 

/ 

From (5.1) this can be written 

(a0 - 0a) * =2.Cj(bjaj - a/fy)*/ 

= 0. (5-4) 

Since * is arbitrary then 

(a0-0a) = O. (5.5) 

The commutator representing the co-ordinate q t and its conjugate 
momentum p f is 



fl 9 

Qi,— 1 — 
i dq t 



= ih (5.6) 

and clearly #,• andp,- are not compatible. 



Theorem 2 

If the operators a and commute, there exists a complete set of functions 
which are simultaneously the eigenfunctions for both a and 0. This 
implies that the observables A and B are compatible and this theorem is 
the converse of theorem one. 



Proof 

Consider first the case when the operator a has non-degenerate eigen- 
values. It will be shown that every eigenfunction of a is also an eigen- 
function of if a and commute. Let *< be one of the complete set of 
non-degenerate eigenfunctions for the operator a, 

a*,-=fl,*,-. (5.7) 
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Operate on this equation with and as a and commute 

The function 00/ is an eigenfunction of a with eigenvalue a t . As the 
eigenstates of a are by assumption non-degenerate then clearly 

00,- = (constant) 0,-. ( 5 - 9 ) 

This implies that 0,- is an eigenfunction of and the result is proved. 

Suppose now that a has degenerate eigenvalues. Let 0,- be one of the 
n linearly independent eigenfunctions spanning the n-dimensional sub- 
space associated with the degenerate eigenvalue a. 

a\\i i =a^/ i i = 1,...,h. ( 5 - 10 ) 

Let this sub-space be V(a). Any eigenfunction of a with eigenvalue a 
must lie in this sub-space and vice-versa. Operate on (5.10) with and 
as a and commute 

aWd B aWd i = l,...,n ( 5 - n ) 

(00,) is an eigenfunction of a with eigenvalue a and must belong to V(a). 

00,= !««*/• (5 - 12) 

That is, V(a) is 'invariant' under and this is true for each eigenvalue 
sub-space of a. 

Since the eigenfunctions of a form a complete set then any eigen- 
function of 0, with eigenvalue b say, can be expanded 

2 *i*i + * (5 - 13) 

where the sum represents a function in V(a) and is a linear combination 
of the eigenfunctions of a outside V(a). Since each eigenvalue sub-space 
of a is invariant under then the linearly independent functions 

x=iwi (5 - 14) 

and are each eigenfunctions of 0. 

00 = 60 (5.15) 

X = 6X- < 516) 

From (5.14) x is simultaneously an eigenfunction of a with 

ax=ax- (5.17) 
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The function <t> can be further decomposed into its linearly indepen- 
dent components in the other eigenvalue sub-spaces of a, although some 
of these components could be zero. So each of the eigenfunctions of /3 
can be constructed from a single eigenvalue sub-space of a. Since is an 
operator representing an observable its eigenfunctions form a complete 
set and the theorem is proved. 

In particular there must be n simultaneous eigenfunctions of a and 
that span V(a). They all correspond to the eigenvalue a of a but in 
general will belong to different eigenvalues of 0. 

As an illustration, consider the operators representing the three 
cartesian components of linear momentum of a particle. 

hd_ nd_ hd_ 

i &e' i by' i dz- (5 - 18) 

These operators commute with one another and by theorem two there 
must be a complete set of functions which are simultaneously eigen- 
functions of all three. The eigenfunctions are 

[i \ Px 

Hx, y, z) = expl- (p x x + p y y + p z z)\ -°°<p y <«> (5.19) 

*■ 1 Pz 

where p x , p y , p z are the eigenvalues. 

Degeneracy 

For a free particle the wave- function (5.19) satisfies Schrodinger's time- 
independent equation and is a possible state function. The values of the 
momentum components p x >P y ,Pz determine the wave-function, apart 
from the usual arbitrary wave factor, and the three compatible momen- 
tum observables form what is called a complete set. 

The commutation of operators is closely related to the occurrence of 
degeneracies. In proving theorem two it was shown that from the set of 
eigenfunctions 4i u . . ., \p n of a which span the M-dimensional sub-space 
of the degenerate eigenvalue a, it is possible to construct n linearly 
independent functions 

n 

X / = .2*/*i j=l,...,n (5 .20) [(5. 14)] 

which are simultaneously eigenfunctions of a and |3 if the commutator 
[a, j3] is zero. 

0iX'=ax' (5.21)[(5.17)] 

Px'=b'x'- (5.22)[(5.16)] 

The superscript / is introduced to distinguish the «-eigenvalues b'\ 
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As the operators a, commute, the associated observables A and B 
are compatible and may be measured simultaneously. If on such a 
measurement the values a, V are obtained then immediately after, the 
state function is essentially x' • If the eigenvalue b' is non-degenerate the 
wave-function is determined completely. On the other hand, if b> is, say, 
w-fold degenerate (m <w)x ; must be a function in the associated m- 
dimensional sub-space. In this case, experience shows that another 
observable C may be found whose operator y commutes with both a and 
and so may also be measured. After this measurement is carried out the 
wave-function is specified more completely. In this way the dimensionality 
of the sub-space, in which the wave-function is known to be, is reduced. 
More compatible observables may be found and measured until the wave- 
function is specified completely. 

Eventually a 'complete set' of commuting observables is found so that 
when the eigenvalues of each are specified, the wave-function is known 
completely, apart from a phase factor. No other observable can be 
measured unless it is a function of those already in the set. 

For a given system there may be several distinct complete sets of 
observables. All the members of one such set do not commute with all 
the other members of any other set although they may have some 
members in common. 

There is a very interesting alternative but equivalent way of discussing 
degeneracies. This involves a study of the symmetry properties of the 
system and is dealt with in Chapter 9. 



Angular momentum quantum numbers 

It was shown in section 4.3 that the motion of an electron in the field of 
a nucleus is essentially described by the Schrodinger equation 

_ _*? v a + v(r) ^ =E ^ y (5.23)[(4.61)1 

2m 

where the nucleus is at the origin, u is the reduced electron mass and E is 
the internal energy of the system. The potential of interaction is spheri- 
cally symmetric and is a function of r only. It will be shown below that 
in this central field problem three observables form a complete set. 

Classically, the angular momentum about the origin of a particle with 
linear momentum p at the position r is 

L = rAp. (5-24) 

The z-component of this orbital angular momentum is 

L z =xp y - yp x . ( 5 - 25 ) 
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The quantum mechanical operator £P Z representing this observable is 
obtained using the substitutions (3.8). 



*-7l*» 



dx 



(5.26) 



(As there are no non-commuting factors in (5.26) there is no ambiguity.) 
Similarly, operators may be obtained for the other components. 



x i V dz dy 



-S?, 



■?(■ 



dx dz 



(5.27) 



It is easily shown that the operators (5.26), (5.27), do not commute with 
one another and so the observables L x ,L y ,L z are not compatible 
(Theorem 1). For example 



•J v awC i> 



^y x ~ \ i 



LL_ X *]L±. 

y dx dz) \ dz 



dz dy 



dy 



b_ d_ 
dx dz 



i.e. 



J, X ~£*y J* y *t?x '" "^2 • 



(5.28) 



Similarly, 

jsf z se x - se x se z = m se y . (5 .29) 

However, all three angular momentum operators do commute with 
the Hamiltonian of (5.23). In particular, if all derivatives concerned are 
continuous, 



<e z v 2 - v 2 .s? z = o 



(5.30) 



and 



w-FF 



■>* FW 



d£2. 
dr r 



dVx 
dr r 



♦<*£■ 



dx , 
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i.e. 

se,v(f)-v{r)se t =o. ( 5 - 31 ) 

As the Hamiltonian operator is 

h 2 
je=-—V 2 + V(r) 
2m 

then 

J?,Jf- jfJSf.sO. (5-32) 

From theorem two the sets of observables (H, L x ), (H, L y ), (//, L z ) are 
compatible. 

The magnitude of the total angular momentum is 

L 2 =L 2 +L 2 +L 2 (5.33) 

and the operator representing this observable is 

je 2 =# 2 x +tf 2 y +&l (5-34) 

As Sf x , <g y ,& 2 all commute with Jf then so does SP 2 , e.g. 

:. & 2 tf-jrse 2 =o. (5.35) 

Similarly £? x , & y , <£z all commute with-S? 2 , e.g. 

<e z <e' 1 = st,sel + se,s? y + st % se\ 

= (ih^ y + ^ x SC z )^ x +(-ih^ x + Sf y ^ z )Sfy +&1 
= iM&y #x - tfx &y) + &x(#x &+& &y) 

+ &y(&y&z -ih& x ) + &i 

= w& x -** z " ^ v "^z ^-Z'2r 

i.e. 

& z & 2 -& 2 & z =0. ( 5 - 36 > 

Clearly, then the sets of observables 

#,I 2 ,£* (0 

//; Z, 2 , Z, y (ii) 

H, L 2 , L z (hi) ( 5 - 37 ) 

are compatible and in fact each is complete. 

As in section 4.3 it is useful to work in spherical polar co-ordinates 
defined by 

x =r sinfl cos <p j>=rsin0sin0 z=r cos 6. (5.38) 
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The operators (5.26), (5.27) and (5.34) become 



it x = — — I sin — - + cot cos 
i \ 90 



*h) 



h 9 „ 9 

T -cos — + cot sin — 
z \ 90 90 



h 9 

^^UiLi),! ill (5 . 40) 

| sin 90 \ 90/ sin 2 90 2 J v ' 

The eigenfunctions of the Hamiltonian for central field problems are 
essentially 

**«,(/•. 9, 0) = R„,(r)Pr >(cos d)e im '*. (5.41) [(4.72)] 

(In the special case of a Coulomb potential the radial part may be 
identified as a Laguerre function (4.70), and the energy depends only 
upon n). 

Clearly these energy eigenfunctions are also eigenfunctions of the z- 
momentum operator <£ z with eigenvalues timi. m t is called the magnetic 
quantum number. Similarly by comparing (5 .40) with (4.64) it is seen 
that ip n i ml is an eigenfunction of the total angular momentum operator 
& 2 with eigenvalue h 2 l(l + 1). / is called the angular momentum number. 
Measurement of the observablesZ, 2 and L z determines the angular part 
of the wave-function and measurement of the energy determines the 
radial part. 

Observe that \pnimi is not an eigenfunction of the operators Z£ x , 3? y 
except in the case that the quantum number / = 0. Then the wave-function 
depends upon r only and £f x , & y , £f z have the common eigenvalue zero. 
(Note that the commutation rules(5.28), (5.29) are satisfied.) 



5.2 Constants of the Motion 

It is well known that there are important conservation laws in classical 
mechanics. The rate of change of a classical variable F{g u . . ., q„, 
p u ... p„,t) is given by 

dF dF 

-={F,tf} + - (1.34) 

where {F, H} is the Poisson bracket of F and the Hamiltonian. When# 
does not depend explicitly on the time dH/dt = and the Hamiltonian is 
a constant of the motion. Any time independent variable of the system 
whose Poisson bracket with the Hamiltonian vanishes, is conserved. 
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Another example is the conservation of angular momentum of a particle 
moving in a central force field. The Hamiltonian is 

2m 

where the potential energy is spherical symmetric when the origin is 
suitably chosen. The z-component of angular momentum is 

L z =xp y -yp x 
and it is a simple matter to show that the Poisson bracket 

{L z ,H} = 0, ( 5 - 43 ) 

and so L z is a constant of the motion. A similar argument applies for L x 
and L y and consequently the vector angular momentum is conserved. 
There are analogous results in quantum mechanics. In a given state 
described by a normalized wave-function* (4 !, ...q„, t) a general 
observable does not have a definite value. The expectation value of the 
observable, A is given by 

U> =[**<** dr ( 3 - 21 ) 

where a is the representative operator. This expectation value may not 
be constant in time and the rate at which it changes is given by 



c\t J 



3a 9** 9* 

<y* — q, + ___ a \Jr + \fr* a — 

dt dt dt 



dr. (5.44) 



The state function must satisfy Schrodinger's time-dependent equation. 

22=_ijf¥ **! = + l<^)* (5.45)[(3.10)] 

dt ft dt ft 

and as the Hamiltonian operator is Hermitian 

f (JT ¥)*(a¥) At = J ¥*.?ra¥ dr. (5 .46) 

Substitution of (5.45) and (5.46) into (5.44) gives 

-U>=[**(- + -[^a-«^])*dr. (5.47) 

dt J [dt ft j 

This equation is the quantum analogue of (1 .34) and may be used to 
define an operator representing the time dervative of A, 

2H + i(jr«-aJf). (5.48) 

dt ft 
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This illustrates a point already mentioned (3.6) that in the transition 

from classical mechanics to quantum mechanics Poisson brackets are 

replaced by commutators. 

If the operator a does not depend explicitly on the time and if it also 

commutes with the Hamiltonian of the system 

d 

— (A) = 

dt 

and the expectation value of the observable A is a constant of the 
motion, A is said to conserved. In particular, if the state function is also 
an eigenfunction of a, then A has the definite value equal to the eigen- 
value in that state. The state function is an eigenfunction of any given 
representative operator immediately after a measurement of the appro- 
priate observable. However the state function will remain an eigen- 
function only if the observable is conserved and the associated quantum 
numbers are called 'good quantum numbers'. 

For most systems the Hamiltonian does not depend explicitly on the 
time and its expectation value is a constant. 

If the state function is an eigenfunction of the Hamiltonian operator 
at some instant it will remain so. Such a state is called a stationary state 
and the wave-function has the form 

*(<?!, . . ., q n , t) = vK<7i, • • •, q n )e- tEtlh (5.49) 

It is always possible to measure conserved observables simultaneously 
with the Hamiltonian. In the central field problem the observables (H, 
L 2 , L z ) form a complete set with values which are constants of the 
motion. The quantum numbers n, I, mi are good quantum numbers. 



5.3 Uncertainty principle 

Consider two observables A and B with representative operators a and (3. 
If a and commute both observables may be measured simultaneously. 
If the operators a, do not commute they do not have a complete set of 
simultaneous eigenfunctions, (Theorem one). Define y by 

(p^-Pa) = iy. (5.50) 

There are two possible cases. 

(i) If y is a Hermitian operator it may be possible to measure A and B 
simultaneously in certain states. If i/»,- is a simultaneous eigenfunction 
of a and j3 then it must also be an eigenfunction of 7 with zero eigen- 
value. Even though there may be an infinite number of such eigen- 
functions they cannot form a complete set. 

(ii) If y is a real constant then 

•yv//,- = 
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has no non-trivial solutions and a, (3 do not have any eigenfunctions 
in common. A well-known case is that of the position co-ordinate 
and conjugate momentum 

(QiPi - PiQt)-^- 
The position and momentum of a particle can never be known 
simultaneously. Any measurement of the position must disturb 
the momentum and vice-versa. 

Even so, it is possible to obtain a relation between the uncertainties in 

the measurements. 

Schwarz inequality 

If ij/j and \p 2 are two quadratically integrable functions then 

[J ***i dr][J^J^ 3 dr]>| J^J^! dr|*. (5.51) 

This is analogous to the result 

|A| 2 |B| 2 >|A.B| 2 

where A, B are vectors in three-dimensional Euclidean space. Hilbert 
space is a unitary space and the integral / \p*^i dT represents the 
'scalar' product of i//* and \p j. 

Proof 

Let A be a real constant defined such that the function Xi=Mi is 
normalized to unity. 

>l 2 »l/Jl* a l 2 dT. 
Consider the function 

*«*i-Xa{x**idT. (5.52) 

Since /I* I 2 dr>0 then 

JhM 2 dr-2|J X J*i dr| 2 + |{ X J*i dr| 2 >0 
i.e. 

\ l^,l 2 dr>|J X 2^idr| 2 . 
Substitute for xi to obtain 

Jl^ 1 l 2 dr/|^ 2 | 2 dr>|J^ 1 dr| 2 . (5.53) 
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Uncertainty relation 

When a system is in a state described by the normalized function 1// the 
uncertainty (AA) is a measurement of the observable A is defined by 

(AA) 2 =^*[a-(A)] 2 *dT (5.54) 

where the expectation value of A is defined as usual by 

G4> = J ip*a\l/ dr 

(5.54) may be expressed in another form by expanding \p in terms of the 
eigenfunctions of a. Writing 



^ = 1cj4/f with m//;=a,-i//,- 
/ 

(AA) 2 = I\c i \ 2 [a i -U)]'- 



(AA) is the variance of the possible results of measuring A. 
In the Schwarz inequality write 

i//j = [a-C4>]i// and i// 2 = [0-<5>]tf/ 
where is the operator representing the observable A The integral 

J |i//!l 2 dr = J [(o-U >)!//]*[(« -U))"/*] dr 

= J"i//*(a-U>) 2 \!/dr 
since (a — (A >) is Hermitian, i.e. 

j\* 1 \ 2 dT = (AA) 2 . 

Similarly 

j|^/ 2 | 2 dr = (A5) 2 . 

Since the square of the modulus of a complex quantity is greater or 
equal to the square of its imaginary part then 

| j"^! drf >i| J*J*i dr- J*?* 2 dtf 

>i|Ji//*(aj3-j3a)i//dr| 2 . 

The Schwarz inequality (5 .5 1) gives 

(AA) 2 (AB) 2 >\\ JV(a0-0a)«// drf . (5.55) 

If the state function \p is an eigenfunction of both a and /3 then 

(AA) 2 (AB) 2 =0 
as expected. 
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In the more general case when [a, jS] = if 

(AA)(AB)>l\j\l/*y\ljdT\. (5.56) 

This is a general statement of the Heisenberg uncertainty relation. In 
particular, the uncertainty relation for the momentum and position of a 
particle is 

(A?,)(Ap,)>|. (5-57) 

This confirms a point already mentioned in Chapter 4 that when ft can be 
neglected, classical mechanics can be used. 



Incident i 

plane wave ' 



Figure 5.1 Illustration of the uncertainty principle. 

There are several simple experiments which illustrate the uncertainty 
principle. Suppose a particle with a known momentum is incident 
normal to a screen which contains a single slit. The wave-function of the 
particle is the plane wave 

til = e ikx 
and the momentum isp x =hk. (The co-ordinate axes are shown in (5.2).) 
Before reaching the screen, the position of the particle is completely 
uncertain but the component of momentum parallel to the screen is 
known to be zero. After passing through the slit the ^-co-ordinate is 
known with an uncertainty essentially equal to the slit width a, i.e. 



Ay=a. 
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In discovering this information about the particle position an element of 
uncertainty is introduced into the conjugate component of momentum 
p y . After passing through the slit the 'particle wave' undergoes diffrac- 
tion and there is a finite probability for the particle to be deflected 
through an angle A0. It is well known that the first minimum in the 
diffraction pattern occurs when 

A0--. 
a 

A is the 'wave-length' of the particle wave and is equal to hip. A 
measure of the uncertainty in the momentum is clearly 

Ap y — pAO 

h 



AyApy-h. (5.58) 

Note that if the slit is made smaller so that the position is known with 
greater accuracy there is a corresponding increase in the uncertainty of 
the momentum. 

The lowest energy level for a system is consistent with the uncertainty 
principle. For example, consider the particle confined in a cube of side 
a. The position uncertainty Ax ~a implies a momentum uncertainty 
&Px ~ fi/a which in turn implies a minimum energy ~h 2 /ma 2 . This 
should be compared with (4.43) where the lowest non-trivial-state 
corresponds tol = l,m =n=0. 

As interesting applications consider the relative energies of an atomic 
electron and a nuclear a-particle. For the atomic electron a sensible 
characteristic length is 1 A. If Ax ~ 10~ 10 m then Ap x ~ 10 10 h and 
this implies an energy ~10 eV. For the dimension of the nucleus take 
Ax ~ 10" 14 m then Ap x ~ 10 14 h and this implies an energy ~ 1 MeV. 
Atomic electrons do indeed have energies sensibly measured in terms of 
eV and the kinetic energy of emitted a-particles is of the order of MeV. 

5.4 Wave Packets 

A 'wave packet' is a wave-function that attempts to give an almost 
classical description to a particle so that it is possible to give some 
meaning even if uncertain, to the momentum and position simultaneously. 
The one-dimensional wave- function B exp [i(kx — cor)] represents a 
particle with momentum hk and energy ftco. The momentum eigenvalues 
form a continuous spectrum and a superposition of them is represented 
by an integral. 

*(x, r)= j B(k)e^ kx - Wt) dk (5.59) 
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\B(k)\ 2 dk represents the probability that a measurement of the 
momentum will give a value such that 



fi 



Kh<»K 



If the wave-function *(*. i s t0 represent a particle with a characteristic 
momentum it is necessary that B(k) be vanishingly small except for a 
small range of values of k. 

Suppose B(k) has an appreciable value only near k = k . The energy 
and hence the 'frequency' co are functions of fc and may be expanded in 
Taylor series about k 

do; 1 o d 2 co , . 

^) = u 0+ (k-k ) — + -(k-k f^ + ... k~k 



where 

, , dco /dco\ , d 2 co_/d 2 co\ 

"•■ w ^^"ld*J*. and *%-[**)*; 

The wave-functions becomes 

? T /dco 

*(*, « «'<*•* ""•'J J B(k) exp /(* - *„) I * - ^ « 

-i*-*»^'h (5 - 6i) 

If co(fc) is a linear function d 2 co/dfco = and the wave packet does not 
disperse but travels without change of shape. In this case defining 

/(*)= J£(fc)e /(fe - fco) *dfc (5-62) 

then 

*<*, t) ^ f L±± t \ ^(M-,0, (5.63) 

In particular 

#(*, Q) = f(x)e ik ° x 

Since B(k) is appreciable only in the neighbourhood of k = k , f(x) 
must be a relatively slowly varying function of x and is non-zero in the 
neighbourhood of x = 0. Equation (5.63) then represents a plane wave 
with a modulated amplitude 

/ dco 
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and the 'wave packet' moves with a group velocity 

dco 

(5.64) 



Vg djfc- 






a free particle 






2m 


and so 


dco _ hk p Q 
dk m m 



The packet velocity is equal to the velocity corresponding to the mean 
value of the wave number k . 

The wave packet description makes it possible to associate a mean 
position and momentum with a particle but the finite size of the packet 
makes it impossible to specify the position exactly and the spread in the 
fc-values needed to construct the packet make it impossible to specify 
the momentum exactly. This is in accordance with the uncertainty 
principle. 

The above treatment is readily extended to three dimensions and the 
wave packet is then 

oo 

*(r ) = //j5(k)e'' (k - r_u,r) dk (5.65) 

— oo 

with an associated velocity 

V^=grad k w(A:). (5.66) 

Ehrenfest's theorem 

In 1927 Ehrenfest showed that the motion of a wave packet obeys 
Newton's laws. He showed that if < x) and < p) are the average values of 
the position and momentum for the wave packet then 

m—(x) = (p x ) (5.67) 

at 

and if Fis the potential energy of the particle 

d /a v\ 

*<*>-(*/ (5 - 68) 

Proof 

Let *(r, t ) be the state function describing the motion of a particle. 
Assume it is quadratically integrable and normalized to unity. The 
expectation vlue of the x-co-ordinate is defined by 



<x> 



= j"**Jc*dT (5.69) 
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and the average value of the x-component of the velocity is (5.48) 

.1 < x) = - U* [jf x -xjf]* dT. 
At ft J 

The Hamiltonian operator 

2m 
and so 

A <x> = _ A f[**(v 2 x*) - **xV 2 *] dr 
df 2m J 

= _ J*_f**Lv 2 * + 2^ -xV 2 *|dr 

2mJ [ 9* J 

i.e. 

dr m 

The rate of change of the expectation value of the x-component of 
momentum is 






At 



-J-(-s> 



dr (5.71) 



For a sharply defined wave packet <x> and ip x ) can be associated with 
the classical meaning of position and momentum of the particle and 
- < 9 V/bx > can be identified with the classical force component. The 
packet then behaves like a classical particle. 
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5.5 Probability Current Density 

In the previous section it has been shown that the operator representing 
the velocity is the momentum operator divided by the mass, i.e. (tl/im) 
grad (5.70). This operator can be used to find the expectation value of 
the velocity for a given state function. 

A related quantity is the probability current vector S which can be 
used to find the probability that a particle will cross a given surface in 
unit time. Suppose ^(r, r) is the state function representing a one-particle 
system. The probability that the particle is in the finite volume Q, is 

p(Sl) = j **VdT. (5.73) 

a 
The rate of change of this probability is 

dp(fi) f/a** 9*\ 

n 
and as * satisfies Schrodinger's time-dependent equation 

dp(£l) 
At 

The Hamiltonian operator is 

*-- hl 

2m 

and so 



df^ = jj- J" [*(*"*)* - **C^)] dr. 



h2 vUv 



h 2 



*(?f *)* - **pf*) = - — [*V 2 ** - ** V 2 *] . 

Using the vector identity 

div [* grad ** - ** grad ^] = *V 2 ^* - >P*V 2 * 
the rate of change of p(S2) is 



dp (£2) 



t 2m J 



div [^(grad >&*) - V* grad *] dr. 



dt 
A vector function of position S may be defined by 

S = ^- [* grad **-** grad ¥] (5.75) 

and then 

^M = -fdivSdr. (5.76) 

dt J 
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The volume integral can be transformed to a surface integral by Gauss's 
theorem, i.e. 

*£©>=_ *S.dA (5-77) 

where A is the surface enclosing the volume £2 and the direction of dA 
is along the outward normal. Clearly from (5.77), the integral of S over 
the surface A is the probability that the particle will cross the surface 
going outwards in unit time. S is the probability current density. From 
(5.74) and (5.76) 

— (**^) + div S = 0. ( 5 - 78 ) 

This equation is analogous to the equations of continuity of hydro- 
dynamics and electrodynamics. Observe (5.75) that if * is a real wave- 
function, the probability current density is zero. 

Consider the energy eigenfunctions for a plane wave moving in the 
x-direction. The wave-function 



ikx 



represents a density oiN particles per unit volume. The probability 
current density vector has a non-zero component only in the x-direction 
and 

S x = — N. 
m 

This is the expected result as the particles have momentum**: and this 
expression gives the number of particles per second crossing unit area 
perpendicular to the x-axis. 

A more interesting example is obtained by considering the central 
field eigenfunctions. From (5.41) these have the form 

Mr,e,<t>) = f(r.e)e imet> 

where /(r, 0) is a real function of r and such that \p is normalized to 
unity. The vector S has a non-zero component only in the ^-direction 
i.e. in the direction 3r/30 about the z-axis. A simple calculation show 
that 

v prsiaO 

where n is the reduced electron mass. The spherically symmetric s-states 
have zero current density. For thep, d states etc. equation (5.79) 
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represents an electric current circulating about the z-axis and it can be 
shown (Slater, p. 163) that the associated magnetic moment is 

f r , (5.80) 



PROBLEMS 

1 A function of the operator a can be defined by 

/(«)= 2 k„a". 

71=0 

Show that the eigenfunctions of a are also eigenfunctions of /(a) and 
express the eigenvalue of the function operator in terms of the eigenvalues 
of a. 

Explain why if a is Hermitian then /(a) is also Hermitian. 

If the commutator of the two operators a, j3 is zero show that the 
commutator [/(a), 0] is zero also. 

2 The angular momentum about the origin of a particle with linear 
momentum p is given by 

L = r A p. 

By differentiating L with respect to time show that in the case of a 
central field when the force is directed towards the origin that L is a 
constant of the motion. 

3 In the case of a system of two particles the operator representing the 
^-component of the angular momentum about the origin is 

fl ^ / a 



^-i2[ x "^-- y " 



n = 1 \ 

and similarly for & x and jSf y (5.26), (5.27). 

If the masses of the particles are m i , m 2 the x-co-ordinate of the 
centre of mass is 

m^Xx +m 2 x 2 

x = 

mi +m 2 

with similar expressions for y and z (4.46). The co-ordinates of the 
second particle relative to the first are 

ot=(x 2 -x 1 ), (3 = (.V2-7i)> 7 = (z2-zi). 
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Verify that the given expression for & z can be transformed to give 



*.-? 






Explain the significance of the two terms in the angular momentum. 

4 Show that if a is an operator and * t and * 2 are two quadratically 
integrable state functions then 



df 



(Vfa^ dr = J*t 



— + -(jfa-aJfT) 
df h 



* 2 dT. 



(Compare equation (5.47).) 



5 Show that if a and (3 are two Hermitian operators and 4> y and 4< 2 
are two quadratically integrable wave- functions then 

Hence show that 

j" ^[ctf - 0a] 4/ 2 dr = j" [(/& - a/3)i//i] *^ 2 dr. 
Deduce that if an operator y is defined by 

(a0-j3a) = /7 
then y is Hermitian (5.50). 

6 It can be shown that for a linear harmonic oscillator the expectation 
value of the potential energy is equal to the expectation value of the 
kinetic energy. In particular, for the lowest energy state 



— kx 

2 



2m 



"M" 



-if 



As the expectation values of x and p are clearly both zero then the 
uncertainties in x and p are respectively given by 

(Ax) 2 =ji>*x 2 4>dT, (AP) 2= $**(j-t) * dT - 
Show that in the lowest energy state 



(Ax)(Ap) = - 
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in agreement with the certainty principle. Confirm that for the higher 
states 



(Ax)(Ap)>- 

7 The function \pnimi (5 .41) is an eigenfunction of J§? 2 with eigenvalue 
fim. The commutator of 2£ x and S£ y is 



Using equation (5.55) show that 
(AL xX AL y )>^. 



(5.28) 



8 Verify that the result of adding the two travelling waves 
y x =A cos (k\X - cojr) y 2 = A cos (k 2 x - co 2 t) 
is 



18k 5co . 
y = yi + 72 = 2^4 cos I —x - — t J cos (k x — co r) 



where 



k u +k 2 



CJn =' 



CO, +002 



bk ~ k\ — &2> 



5co = CO! — CJ 2 . 

This represents a wave with average frequency oj with a modulated 
amplitude. Verify that the modulation moves with a velocity 5co/6&. 
(Compare (5.63) and (5.64).) 



9 Verify that the wave packet 
A 



*(*, = 



■[■*♦">] 



-,'/( a 2 + -^2) 



is a solution of Schrodinger's time-dependent equation in one-dimension 
for a free particle. Show that the expectation values of the position and 
momentum are both zero and derive the following expression for the 
variance 



(Ax) 2 



fit 



2 \ma 
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This shows that a wave packet (in a dispersive medium where E(k)) 
spreads in time. You may assume the definite integrals, 

oo oo 

J x 2 e~ x * dx = y/H/2 J e~ xl dx = \fH. 



10 Prove the vector identity 

divO grad yfi* - yji* grad <//) = \jj V 2 v|/* - ^*V 2 \p 
where i// is a scalar function of position. 

1 1 Let iinimi be an eigenfunction of the central field problem. The 
current density at a point in space is (from (5.79)) 

• etim i I I |2 

[it sm 6 

The magnetic moment associated with a current carrying loop is equal to 
the current in the loop multiplied by the area of the loop. By first of all 
considering the magnetic moment in an annulus of small cross-section 
area about the z-axis and then summing over all such annuli show that the 
total magnetic moment is 

m eh 
2ju 

12 The Hamiltonian operator is, in spherical polar co-ordinates, 

3\ 1 



fl 2 

2m 


[J_JL( 

r 2 br\ 


1 

"*" 2 • 2 

r sin 


3 2 " 
6 d<t> 2 



dr I r sin 6 



—= +V(r,6,<p) 



9 / • « 9 
— sin 6 — 

dd \ dd 



where m is the particle mass. 

Explain why, if the potential V is independent of 6 and <p the energy 
eigenfunctions can also be chosen to be eigenfunctions of the total 
angular momentum operator JSf 2 . 

Show that the radial part R(r) of the variable separable eigenfunction 
of an energy state with angular momentum quantum number /, satisfies 



1 d / , dR\ fi 2 1(1+1) 



2n r 2 dr \ dr / 2m i 

where E is the state energy. 

Substitutei?(r) = x( r )l r an d obtain 

ft 2 d\ 



R + VR =ER 



2m dr 2 



/(/ + !) 

2 Mr 2 



h 1 + V(r) 



X=EX- 
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Suppose 

V(r) = -V txp[-r/a], V >0 

By substituting z = exp [—r/2a] show that the radial wave-functions for 
bound states with / = are of the form 

R(r) = jj v (ke- r/2a ) 

where A is a constant and 

J v (k) = 0. 
[J v is the Bessel function of order v.] 

References 
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CHAPTER 6 



Matrix Representations 



6.1 Introduction 

In the previous chapters the theory of quantum mechanics has included 
differential operators and differential equations. There is an equivalent 
alternative description in terms of matrices. Matrix mechanics was 
developed by Heisenberg, Born and Jordan and actually preceded the 
Schrodinger wave mechanics already described. In this chapter matrix 
mechanics will be deduced from the Schrodinger approach. 

An eigenvalue problem can be interpreted in terms of solving a set of 
linear equations. Suppose a is a Schrodinger differential operator 
representing an observable A and let ij/ be an eigenfunction with eigen- 1 
value a; 

\j/ can be expanded in terms of the complete set of orthonormal eigen- 
functions belonging to some other operator, say (3; a discrete spectrum 
is assumed, i.e. 

^=2<7<*y with P<t>,=bj(t>i. (6.2) 

With this expansion for ty, equation (6.1) becomes 

J,CjQ.(t>j=a2c i <t> i . ( 6J ) 

/ /' 

If this equation is premultiplied by 4>* and integrated over all the space 
then 

2c ; a//=ac; (6.4) 

/ 
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where 

<*// = J 0*<*0/ dr. 
Equation (6.4) is true for all / and represents a set of linear equations. 

a ll c l + a 12 c 2 + . . . = <IC\ 

a n ci +a 22 c 2 + . . .-ac 2 

"31^1 + a 32 c 2 +...=ac 3 (6.5) 



As the operator j3 has an infinite number of eigenvalues then (6.5) includes 
an infinite number of equations. Sets of equations such as (6.5) can 
conveniently be represented in terms of matrices and it is useful at this 
point to make a brief digression to remind the reader of the essentials of 
matrix algebra. 



6.2 Matrix Algebra 

A matrix is a rectangular array which obeys certain rules of algebra, e.g. 




(6.6) 



otij is called an element of the matrix A, the subscript i denotes the row 
and the subscript/ the column. The matrix (6.6) with m rows and n 
columns is called an 'mbyn' matrix. If the number of rows is equal to 
the number of columns then the matrix is square. 
A matrix 




composed of one column is called a column matrix or a column vector. 
Similarly a matrix with a one row 

(c\, c 2 c n ) 

is called a row matrix or row vector. 
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Two 'w? by w' matrices are equal if and only if corresponding elements 

are separately equal. 

The two fundamental laws of matrix algrebra are addition and 

multiplication. 

Addition and subtraction 

The sum of two i m by n' matrices Tand A with respective elements 7,7 
and \ij is defined by the 'm by ri matrix A with elements 

<%/ = -fy+fy (67) 



and 



e.g. 



a = t+a 




a = t + a 



Addition is only defined for matrices that have the same number of rows 
and the same number of columns. 

Substraction is the reverse of addition and the matrix 

2 = r-A (6-8) 

with 

%■ = lit - \i 

is called the difference of T and A. 

From the definition (6.7) it is obvious that the addition of matrices is 
both commutative and associative, i.e. 

r + A=A+r ( 6 - 9 ) 

and 

(A + r) + A=A + (r + A). (6-10) 

Multiplication 

The product of the matrix T and the constant d is defined to be the 
matrix dTwith elements 

(dl\=d7 iy . (6-11) 
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The product A = TAof two matrices is defined only when the number 
of columns of T is equal to the number of rows of A. The elements of 
the product matrix are given by 

a i j = Zy i i\,j (6.12) 

e.g. 




A=TA= 2 5 6 1. (6.13) 



Observe that the number of rows of A is equal to the number of rows of 
r and that the number of columns of A is equal to the number of 
columns of A. Matrix multiplication is not commutative, i.e. 

TA^Ar in general. (6.14) 

For example 

Ho 1) a {\ >) <6i5) 



then 



< :) 



rA= and AT = 



1 3 
1 2 



However matrix multiplications is associative, i.e. 

(Ar)A = A(TA). (6.16) 

Also multiplication is distributative over addition. 

A(r + A) = Ar + AA (6.17) 

A matrix whose elements are all zero is called the zero or null matrix 
and is denoted by 0. 

An important result is that 

TA=0 

does not necessarily imply 
T=0 or A = 
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e.g. 



(! :)(-; m: : 



It is now clear how sets of equations can be represented by matrices. 
The equations 

a n Ci + a n c 2 + • • • + ai„c„ = <*i 



a ml c l +a m2 c 2 + ■ ■ • + a mn^n ~«n 

can be written 

Ac = a 
where 

/a n ai 2 ...ai„\ 
\a m i 0L m2 ...<X m n) 
and c and a are the column matrices 





c= . a = 



The transpose of a matrix 

Suppose T is an 'm by n' matrix with elements 7,7. The transpose of this 
matrix is an 'n by rrC matrix V such that 

(f) V =7/,- ( 618 > 

The columns of T become the rows of f and vice-versa, e.g. if 

\2 0/ th6n f ^ * ' 
Qearly the transpose of a transpose is the original matrix, i.e. 

(f) = r. (6.19) 
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The transpose of a product is the product of the transposes in 
reverse order. 

(fift,=<r% 
= 2(r)//(A)„ 

= 2(A) l7 (f) // 
= (Af) f/ 

i.e. 

(?A)=Af. (6.20) 

Square matrices 

A matrix that has the same number of rows and columns is called a 
square matrix. 

If a square matrix is equal to its transpose it is called a symmetric 
matrix, i.e. if 

f = f (6.21) 

eg- 




A square matrix whose only non-zero elements lie in the principal 
diagonal is called a diagonal matrix, e.g. 




All diagonal matrices of the same order commute and have diagonal 
products. 

A diagonal matrix whose elements in the principal diagonal are all 
unity is called a unit matrix and is denoted by I. The unit matrix of 
order three is 
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Trace 

The trace (or spur) of a square matrix is the sum of the diagonal elements; 

Trace T=27«. (6 - 22) 

i 

The trace of a product of two matrices does not depend on the order 
of the factors. 



Trace (IVY) = 2 (?7i/ty) = ? (2 Vty) 



i.e. 



Trace (TA) = Trace (AT)- ( 6 - 23) 

The inverse of a matrix 

The inverse of the square matrix T is the matrix T" 1 such that 

rr _1 =r _1 r=i. ( 6 - 24 ) 

This inverse exists if and only if the matrix T is nonsingular; that is if the 
determinant of the elements of T is non-zero. In this case the inverse is 
obtained from T by replacing each element by its cofactor and then 
transposing the resulting matrix and finally dividing it by (det T), e.g. if 

'■(is) - -iU ;)-(-:»)• 

If T is a diagonal matrix, the inverse is also a diagonal matrix whose 
non-zero elements are the reciprocals of those of T. 

0. . . 

722 ■• • 




l/7nn' 

(6.25) 

The inverse of the inverse of a matrix is the original matrix. To prove 
this consider 

IT" 1 = I- 

Put T = A" 1 then 

A -i (A -i r i =I 
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Premultiply this equation by A and then 

(A _1 r l =A (6.26) 

as required. 

The inverse of a product is the product of the inverses in reverse order. 
If (TA) -1 is the inverse of (TA) 

(rA)(TA) _1 = I. 
Premultiply this equation by T _1 . 

A(TA) _1 = r -1 . 
Now multiply by A -1 and 

CTA) -1 =A~ 1 r -1 (6.27) 

as required. 

Complex matrices 

The matrices that occur in quantum mechanics generally have complex 
elements. The complex conjugate of the matrix T is denoted by T* and 
has elements such that 

(T*);/ = (r),;. (6.28) 

A square matrix T whose transpose is equal to the complex conjugate 
is called Hermitian or self-adjoint, i.e. 

f=r*. (6.29) 

Clearly the elements in the principal diagonal of a Hermitian matrix 
must be real. If all the elements are real, the matrix must be symmetric. 
A square matrix T whose tranpose is equal to the complex conjugate 
of the inverse is called a unitary matrix, i.e. 

f = (r -1 )*. (6.30) 

A real unitary matrix is called an orthogonal matrix. 

Eigenvalues 

Let A be a square matrix with n rows and c a column vector with n 
elements. In general the product Ac will produce another column vector. 
However in some special cases this resultant column vector is simply a 
multiple of c. 

Ac = ac. (6.31) 

This is a matrix eigenvalue equation. The null vector c = is a solution 
for all finite a. A value for a for which c & is called an eigenvalue of the 
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matrix A and c is the corresponding eigenvector. Equation (6.31) 

represents the n equations 

a n ci +a 12 c 2 + ...+a 1 „c„=ac 1 

««lCi +<* n 2< ; 2 + • • ■ +a nnC n = ac n 
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i.e. 



(6.32) 



(a n - a)ci + a 12 c 2 +...+ <*i n c n ~ 



a fll C, + a n2 C 2 + - • • + (<*nn - a )Cn = 0. ( 6 - 33 ) 

This system of n homogeneous linear equations has a non-trivial solution 
if and only if the characteristic determinant is zero. 

0(1 



det(A-aI) = 



a n -a 



«m 



a 12 



<*«2 



l m 



a„ 



= 0. 



(6.34) 



This determinant can be expanded to give a polynomial of nth degree in 
a: the characteristic polynomial of A. The n roots of this polynomial are 
the eigenvalues and may be real or complex. For every eigenvalue, the 
equations (6.32) have a solution giving the corresponding eigenvectors. 
However it should be observed that each eigenvector can be multiplied by 
an arbitrary constant and still remain an eigenvector. Consequently it is 
always possible to define normalized eigenvectors that satisfy 

c*c = 1. ( 6 - 35 ) 

As an illustration consider the two by two matrix 

M: i)- 

The eigenvectors of A are the solutions of the simultaneous equations 
-OCy + c 2 = 
c x -ac 2 =0. 
These equations have a non-trivial solution only when 



= a 2 -l=0 



giving 



a = ±l. 
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The corresponding normalized eigenvectors are 



c+ 



= V2 (l) C - = V^(-lj- 



Theorem 1 

The eigenvalues of a Hermitian matrix are real. 

Proof 

Suppose A is a Hermitian matrix and a is an eigenvalue with non-zero 
eigenvector c. 

Ac = ac (i). 
Premultiply this equation by c* 

c*Ac = ac*c (ii). 
From (i) 

A*c* = a *c*. 
Premultiply by c 

cA*c* = a*cc*. 
As A is Hermitian 

cAc* =a*cc*. 
Transpose (using (6.20)). 

c*Ac a*S*c 
and consequently by comparison with (ii). 

a=a* (6.36) 

and the eigenvalue is real. 

Theorem 2 

The eigenvectors of a Hermitian matrix belonging to different eigenvalues 
are orthogonal. 

Proof 

Let A be a Hermitian matrix with eigenvectors c„, c„, such that 
Ac„=fl„c„ (i) Ac m =a m c m (ii). 
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Premultiplying (i) by c m 

c„Ac„ =a„c m c„. 
Also since A is Hermitian 

c*Ac„=£mA*c n =(A*c*)c„ 

~ a m c m c n 

as a m is real. 

(a n -a m )CmC„ =0 
and as a„ # a m 

c*c„=0 n*m. ( 6 - 37 > 

That is the eigenvectors c m , c„ are orthogonal. 

6.3 Matrix Mechanics 

A quantum mechanical eigenvalue problem can be stated in differential 
equation form (SchrSdinger) or in matrix form. If a is a Schrodinger 
differential operator representing an observable A , the eigenvalues a are 
given by 

<**, = «,** (6>1) 

It has been shown in section 6.1 that this can be represented as a set of 
simultaneous equations (6.5). Writing 

^ = 2cj<t>j with «y=ft,ty ( 6>2) 

;' 
then in matrix notation the equations (6.5) are 

(6.38) 




with <ty = {*fa4yd T « (639) 



and 




(6.40) 
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The orthonormal set {#,} is called the basis of the representation and 
the column vector c is the representative of the wave-function \jj. 

It is assumed that the operator spectrum is discrete and the matrices 
(6.38) have a denumerably infinite number of rows and/or columns. All 
the appropriate results given in section 6.2 for finite matrices will be 
taken over in the infinite case. The more general case when the spectrum 
is partly or wholly continuous is outside the scope of this book. The 
reader is referred to Mathematical foundations of Quantum Mechanics, 
by John von Neumann. 

Hermitian matrices in quantum mechanics 

As the operator a represents an observable it must be Hermitian 
(postulate 2). Clearly 

and the matrix A is Hermitian (6.29). It has already been shown in 
theorems one and two that such matrices possess real eigenvalues and 
that their eigenvectors belonging to different eigenvalues are orthogonal. 

The diagonal representation 

The matrix representing an observable can always be put into diagonal 
form. If the orthonormal basis used to expand the eigenfunctions of (6.1) 
is taken to be these eigenfunctions themselves, then the diagonal represen- 
tation is obtained. 

a,, = j^*a\l//dT = a/Sij. 

The matrix eigenvalue equation (6.38) becomes 

'«i . 
a 2 . 
a 3 II c 3n \=a n ( c 3n I. (6.41) 



The eigenvectors are 







c„= -I . (6.42) 
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The diagonal matrix representing the Hamiltonian operator for a 
one-dimensional harmonic oscillator is 

n o o . 

H=ftwl 0^0. 
\0 \ . 

and the eigenfunction with energy (n + i)«co is represented by the 
column vector 



Cn 




Transformation of basis functions 

An arbitrary wave function \p can be expanded in terms of a complete 
orthonormal set of basis functions which span the Hilbert space of the 
system, 

* = 2<M JV*/dr = 5i,, (6-43) 

/ 
The <fr's are analogous to unit vectors and the c/'s are the 'components of 
the vector' c. 

Of course there are different sets of basis functions apart from {<pj j. 
Suppose the orthonormal set {X/} also spans the Hilbert space. 

Clearly the members of the new set can be expanded in terms of the 
old set, i.e. 

Xi = ll>pu p i /=1,2,... (6-44) 

p 

where u P i are the expansion coefficients and form an infinite matrix u. 

Similarly 

fc = Zx,»tf i=l,2,... (6-45) 

q 

where v pi form an infinite matrix v. Substitution of (6.45) into (6.44) 
gives 

X/ = 2x< 7 2 v qp u pi 

Q P 

2 V qp Upi = 8 qi . 
P 
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In matrix language 

vu = I. (6.46) 

Similarly, substitution of (6.44) into (6.45) gives 

uv = I. 
From the definition of a matrix inverse (6.24) 



v = u 1 . 



(6.47) 
(6.48) 



As both basis sets are orthonormal, then from (6.44) 
J XiXi dr = 8„ = { 2 4>* p u* pi 2 **««/ dr 



i.e. 



Sy-2up/M p/ -. 
P 

An element of the transpose matrix u will be denoted by u pi . As 



. 5 f 



«™="/p 



then 



Si/ = 2u?pU pj . (6.49) 

p 

In matrix rotation 

I = u*u. (6.50) 

It has been shown that u has an inverse and if (6.50) is post -multiplied 
by u -1 

u"'=u*. (6.51) 

This final result shows that the matrix transforming one orthonormal 
set into another is unitary (see (6.30)). 

A change of basic functions is analogous to a rotation of co-ordinate 
axes in three-dimensional Cartesian space and as is well known such 
rotations are performed by orthogonal (real unitary) matrices. 

Transformation of a representative 

Obviously the representative of an arbitrary wave function \j/ will alter 
under a change of basis functions. Suppose 

^ = 2c/ty (6.43) 
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in terms of the old set {$,} and 

i 

in terms of the new set {x,}- The members of the new set can be 
expanded in terms of the old set and substituting (6.44) into (6.52) 

<// = 2 /* 2 <t> p u pi = Z (Zupi/i) P . (6-53) 

i p p \ I" / 

Comparison of (6.43) and (6.53) gives 

Cf = 2ujif, 7 = 1,2,... 
i 

i.e. 

c = uf. ( 6 - 54 ) 

The old representative c is equal to the transformation matrix times the 
new representative f. 

Transformation of an operator matrix 

The infinite square matrix representing an operator a must also alter 
under a change of basis. A* 1 ) will be used to represent the matrix with 
respect to the set {ft) and A< 2 > for the matrix in the {x/} representation. 
From (6.4) 

<#) = j £<#, dr, <4 2) = J x?«X/ dr. (6.55) 

Using (6.44) 

J p 1 

= 22 14^0$ = 2 2 «,> 0$ U qj 
p q PI 

i.e. 

A (2) = u * A ( 1 >u. 

As the transformation matrix is unitary 

A (2) =u -i A (i) u (6.56) 

This is a similarity transformation. 

Theorem 3 

A Hermitian matrix remains Hermitian under a unitary transformation. 
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Proof 

Suppose the Hermitian matrix A^) is transformed into A< 2 ) by the 
unitary transformation u. 

A (2) = U -1 A (D U . 

Transposing 

A (2) = 5A (1)U- 1 . 

As the matrix u is unitary and A^ 1 ' is Hermitian 
A 72) =(u -i A (D u) * =A *(2) 

and so A* 2 ) is also Hermitian. 

Theorem 4 

The eigenvalues of a Hermitian matrix remain invariant under a similarity 
transformation. 

Proof 

Suppose the operator a is represented by the matrix A^ 1 ^ in the original 
representation. The eigenvalue a satisfies 

A (1) c=ac (6.57) 

where c is the representative of the eigenf unction. If the basis functions 
are transformed using (6.44) the operator matrix undergoes a similarity 
transformation (6.56), so that 

A (D =llA (2) u -i ( 6 . 5g) 

and the eigenvalue equation (6.57) may be written 

uA^u" 1 c = «c. 
On premultiplying by u -1 

A (2) (u- 1 c) = a(u _1 c). 
From (6.54) this becomes 

A (2) f = «f (6.59) 

where f is the eigenfunction representative in the new basis and the 
theorem is proved. 

Theorem 5 

The matrix of the product of two operators is the product of the 
separate matrices representing the operators. 
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Proof 

Suppose y and X are two quantum differential operators and that {^} is 
a complete set of orthonormal wave functions. Clearly 

Ui = I\ ii <l> i . (6.60) 

/ 
Xfi is an element of the matrix A representing X in the basis {</>,} and 

Then 

y\<l>i =2x /7 70,- 

= 2 x t i 1iij<t>i 

i i 

= IZyij\fi<l>, (6.62) 

i i 

where 

7// = J 0*70/ & 

and is a matrix element of T. 

Premultiply (6.62) by 0*, and integrate over all the space. Then 

(7X) mi = f 0m7X0,- dr = 2 T m/ X /7 . (6.63) 

J i 

and the theorem is proved. 

The matrices satisfy the same commutation relationships as the 
differential operators. In particular the matrices representing the co- 
ordinates q,- and the conjugate momentum p/ satisfy 

p/q* - q*r py = j I5 a/- ( 6 - 64 ) 

As diagonal matrices (of the same order) commute, then (6.64) 
implies that there is no representation in which p/ and q/ are simul- 
taneously of diagonal form. This of course is an expression of the 
uncertainty principle in that the operators p/ and qj do not possess a 
simultaneous set of eigenfunctions. 

Angular momentum matrices 

Simultaneous eigenfunctions can be found for the angular momentum 
operators J§? 2 and & z with eigenvalues /(/ + \)h 2 andftm/ respectively. 
These simultaneous eigenfunctions 4/(1, mf) form a complete set and an 
arbitrary wave-function can be expanded in terms of them. 

*(r) = 22 c(l, mdiKl, m t ). (6.65) 
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One choice for the basis functions is the orthonormal hydrogen type 
functions (5.41). 

ty (I, m,) = R ni (r)If l (cos 8) e""'* m, < /. (6.66) 

The matrix elements for the angular momentum operators in this 
representation are 

(^ 2 ) /m/>/ . m/ ,=a 2 /(/ + i)s„'8 m , m; , 

and 

(-^z)/m ; , I'mi' = ^""/5//'5 m/m/ .. 
The matrices are both diagonal 



(6.67) 



(6.68) 



L 2 =a' 
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(6.69) 



L 2 ="i 



/^J 
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-1 
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-1 
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-2 



(6.70) 



/ 



The matrices representing & x ,& y are not diagonal. 



The Schrodinger picture 

In Chapter 3 the basic postulates of the Schrodinger theory were 
given. The differential operators representing observables are in general 
time independent and the time-dependence of a state is included in the 
wave-function which undergoes a rotation in Hilbert space as time passes. 

The Schrodinger equation has an analogy in matrix mechanics. Let 
{0/} be a complete set of time-independent orthonormal basis functions 
which can be used to expand an arbitrary state wave-function. 



*(»,f) = 2c/(fXj(r). 



(6.71) 



MATRIX REPRESENTATIONS 143 

As *(r, t) must satisfy the Schrodinger equation 

1 i 

Premultiply this equation by 0*(r) and integrate over all space. 

2Jt v c,{t)=mct{t) 



where 



Jfi^j'fcVtydT. 



In terms of matrices 

Hc(r) = /ftc(f). (6.72) 

The time-dependence of the wave-function ^(r, r) is reflected in the 
time-dependence of the column vector. 

In the 'Schrodinger' picture where the expansion (6.71) is employed 
any time-independent differential operator is represented by a time- 
independent matrix. 

When the basis functions used in (6.71) are the energy eigenfunctions, 
the matrix H is diagonal and the time variation of the vector c is given by 

c i (t)=e- iE i t /"c i (0). 

The Heisenberg picture 

There is another important way of discussing the time variation of a 
state. This is the Heisenberg picture in which the basis functions used to 
expand an arbitrary function are time-dependent. Suppose */(r, a™ 1 
$/(r, f) are solutions of the system Schrodinger equation corresponding 
to the initial states $,(r, 0) and */(r, 0) respectively. The Hermitian 
character of the Hamiltonian Jf determines that the 'inner product' of 
the two functions remains constant in time. 



^ Ml <^\[f *,**?% 



dr = 0. 



In particular if the complete set {$,} is orthonormal at f = then it will 
remain so for all time if 

3e^j = m^P for all/. (6.73) 

; dt 

Since the set is complete it can be used to expand an arbitrary 
solution of the Schrodinger equation (6.73) 

*(r,0 = 2c / * / (r,r) (6.74) 
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Since the set is orthonormal for all time 

cy = j*;(r,0*(t,f)*- 

and from (6.73) the coefficients cj are constants. 

The column vector representing the wave-function is now time- 
independent and the time variation is included in the operator matrix. 

Suppose a is a Schrodinger differential operator representing the 
observable A. The elements of the operator matrix A are given by 



«*/ = /**(', 0«*/(r,f) dr. 



(6.75) 



From Chapter 5 (equation (5.47) and question 4) the elements of A 
are given by 



and in matrix notation 



9a ' ,^, 



*/(r, i) dr (6.76) 



: 3A i* 

A = — + -(HA-AH) (6.77) 

where the elements of the matrix 3 A/9? are 

If the differential operator is time-independent then Heisenberg's 
equation (6.77) becomes 



A = - (HA - AH). 

n 



(6.78) 



As an example consider the matrices q,- and p,- representing the fth 
components of position and momentum for a free particle. From (6.64) 



<L.= l(H<,-q,H) = ^ 



Pi=-(H Pi - Pi H) = 



since 



1 = 1,2,3. 



H = — (p?+p|+pi). 
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A useful representation is obtained when the basis functions are the 
energy eigenfunctions so diagonalizing the Hamiltonian matrix. Then 
the elements of the time derivative matrix A are (from (6.78)) 

where E k and £/ are energy eigenvalues. Integration of (6.79) gives 
'.(E k -Ej) 



ajt/(0 = a*/(0)exp 



(6.80) 



In this representation, off-diagonal elements oscillate in time with 
frequencies co*/ = (E k — Ej)/h. 

The density matrix 

The quantum mechanical theory so far developed deals with 'pure states' 
for which the state function is completely known. Even in this case the 
value of an observable is not well defined unless the state function is an 
eigenfunction of the observable operator. It is often the case that the 
state-function is not known completely and then additional uncertainties 
must appear. There may be many wave-functions compatible with the 
incomplete information known about the system and the effects of these 
must be suitably averaged. This second statistical aspect, due to the fact 
that not all the information that can be known is known, is dealt with 
by statistical mechanics and occurs in both quantum and classical 
mechanics. 

Such a quantum-mechanical system which is not known as completely 
as is possible, is said to be in a 'mixed state'. It is useful to introduce the 
concept of a representative ensemble. This is a collection of similar 
systems each described by different wave-functions but with the property 
that the values of the known observables are the same in all systems. The 
values of the remaining observables (of a complete set) form a suitable 
chosen distribution. 

Consider a representative ensemble composed of N systems described 
by the normalized wave-functions ^j, ^ 2 » • • •» *at- Let {<£>/} be a 
complete orthonormal set which will be assumed to be discrete. Then 

*t = 2c fl *j with c /l - = f*** i dT (6.81) 

/ 

A 'density operator' p is defined by specifying its matrix elements in 
this {$ ; } representation. 

>*/-J72 c J c «- (6 - 82) 
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It can be shown that the density matrix p is Hermitian and transforms 
like any other operator (6.56). 

In the special case that the wave-function is known completely (*/ 
say), the density matrix takes the form 

Pkf=C*iC kl . 

The square of this matrix is given by 
(P 2 ) k f = 2 (c* i C k t)(c* i Cn) = p kj 

since the wave-function is normalized and 

2c,*c H =l. 
i 

The density matrix is equal to its own square when the ensemble is 
described by a single wave-function. 

In the general case, the density matrix contains all the physical 
information that is known about the system and can be used to find 
the mean value of any observable. Suppose A is an observable represented 
by the Schrodinger operator a. The expectation value for A for the z'th 
system in the ensemble is 

<A> = (*?a% dr = 2 2 c* ( c ki a, k (6.83) 

j k 
where 

a/*=j */<*** dr. 

The elements a, k form the matrix representing A in the representation 
defined by the set {<&/}. The mean value for this observable for the system 
represented by the ensemble is from (6.83) and (6.82) 

[A] =^ J (?2^c fc , a/fc | = Trace (pa). (6.84) 

This equation is valid in all representations. 

In classical mechanics the observable A is a function of the co-ordinates 
and momenta. The classical average for this observable for the ensemble is 

[A]=jf(p,q,t)A(p,q)dpdq 

where the integration is over phase space and f(p, q, t) is a suitably 
chosen distribution function. The density matrix in quantum mechanics 
is analogous to the classical distribution function. 

In the Schrodinger picture the expansion coefficients are in general 
time-dependent. A simple expression can readily be obtained for p. The 
expansion coefficients satisfy (6.72). 

^.J^ kj Cji=ihc ki . 
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From this equation together with (6.82) and remembering that Jf is 
Hermitian it is easily shown that 

ih'pki = 2 [tfkjPji - PkjtfjA 
i 

i.e. 

p = _l(Hp-pH). (6.85) 

Note carefully that this equation differs in sign from the equation of 
motion (6.78) of an observable in the Heisenberg picture. 

Equation (6.85) does not hold in the Heisenberg picture as the time 
dependence is then in the matrices representing the observable and in this 
case p is a constant matrix. 

For a more complete discussion of the density matrix the reader is 
referred to the review paper Theory and Application of the Density 
Matrix by D. Ter Haar. 



PROBLEMS 

1 Explain how any real square matrix can be written as the sum of a 
symmetric matrix and a skew-symmetric matrix. 

2 Let A be a square matrix with n rows and let c,- be an eigenvector 
with eigenvalue d,- so that 

(A-d,I)c,=0. 

An arbitrary column vector x with n rows can be expressed as a sum of 

the n eigenvectors, i.e. 

n 

X= 2 «|C,-- 
i=J 

Show that A satisfies the Hamilton-Cayley equation 

(A-diI)(A-d 2 I)...(A-d„I) = 0. 

(Note that is the characteristic polynomial of (6.34).) 

3 Prove that the eigenvalues of a skew-symmetry matrix are purely 
imaginary or zero. 

4 Prove that eigenvalues of a unitary matrix have the absolute value 
one and so lie on the unit circle in the complex plane. 

5 Prove that if the matrix Thas eigenvalues dj, d 2 , . . . then the 
matrix T 2 has eigenvalues i\, df, . . . etc. 
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6 Consider the three matrices 

l\ (o -i\ (\ 



Tl= \i o] x >-\i o) T3 = lo -1 

Show that 

(a)T?=T 2 2 =Ti=l 
and 

(b) TjT 2 -T 2 Tj =2/t 3 . 

Show also that Tj,T2 andT 3 have the common eigenvalues ±1 . (These 
are the Pauli spin matrices.) 
Construct the matrix 

|(I^ 3 ) 

and confirm that it has eigenvalues +e and zero. (This result is used in 
the isotopic spin formalism to distinguish between protons and neutrons.) 

7 Verify that the eigenfunction ip„ = y/l sin (rntx) are orthonormal 
over the range < x < 1 and satisfy the Schrodinger equation for a free 
particle in the one-dimensional box <x < 1 . 

Construct the matrix representing the linear momentum 

h d 



i ax 






obtaining 






Pnm = -i4hnm/(n 2 


-m 2 ) 


(n ± m) odd 


= 




(n ± m) even. 


Confirm that the matrix p 


is Hermitian. Use the relation 


H = ^p> 

2m F 







to obtain the elements 3f n and jt 22 of the diagonal matrix H and show 
that 



^ 12 =0. 



8h 2 v 4k 2 
m 2* (4A: 2 - l) 2 


h 2 n 2 
2m 
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8 In the text it was argued that as the quantum mechanical differential 
operators are Hermitian then the representative matrices must also be 
Hermitian. 

This final result may be obtained without reference to the Hermitian 
character of the operators. 

Clearly, any physical observable must have a representation in which 
it is represented by a diagonal matrix with real elements. Using the 
result of theorem three show that an observable matrix is Hermitian in 
all representations. 

9 The adjoint of a matrix T is denoted by T + and is the transpose of the 
complex conjugate, i.e. 

r + =f*. 

Show that a Hermitian matrix is equal to its adjoint (Self-adjoint) and 
that the inverse of a unitary matrix is equal to its adjoint. 

10 Suppose the Hamiltonian operator J^for a system can be written 
Let the orthonormal set {^„} satisfy the equation 

A wave-function ^ satisfying the Schrodinger equation for the total 
Hamiltonian Jf can be expanded as 

* = 2c„(0*«(r,0. 
n 

(Observe that as the functions *(r, t) and *„(r, f) satisfy different 
Schrodinger equations the expansion coefficients are functions of time. 
Compare equation (6.74) for the Heisenberg picture.) 
Show that 



.*" 2 *„*„=« 5^*„ 



V 3 ' 

and derive the matrix equation He = iflc. (This representation is called 
the interaction representation. 3^ represents two non-interacting 
systems and 3tf" represents their mutual interaction. It reduces to the 
Heisenberg picture when H' = 0.) 

Show that the equation of motion of a matrix T representing a time 
independent operator is 

r=^-(H r-rH ). 

Compare equation (6.78). 
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1 1 Using equation (6.23) show that the trace of a matrix is invariant 
under a similarity transformation. Hence deduce that the trace is equal 
to the sum of the eigenvalues. 

12 Consider the definition of the density matrix (6.82). Show that 
under a change of basis functions (6.52-6.54) the density matrix 
transforms like an operator (6.56), i.e. 

p(2) = u -l p (l) u . 

Hence show that 

Trace (p (2) a (2) ) = Trace (p (1) a (1) ). 

confirming that the mean value [A] (6.84) is unchanged by the 
transformation. 

13 It has been shown in the text that the density matrix of a pure state 
isidempotent.i.e. 

P 2 =f>. 

This equation can be written 

p(p-I) = 0. 

Explain why the eigenvalues of p are zero and one. 
Show that 

Trace p = 1. 

Hence deduce that the eigenvalue one is non -degenerate. 

14 In Chapter 4 it was shown that the eigenfunctions of the one- 
dimensional harmonic oscillator are the Hermite functions 

* n (y) = e-y 2 ' 2 H n (y) (4.30) 

where H„(y) is a Hermite polynomial. 

The Hermite functions are orthogonal and satisfy 



Given the recurrence relations 

2«i//„_! =y$ n +K 
2>^„=2n</>„_i +^ n +i 
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show that 

f° 

j ^ m y^ndy = l2 n (n + l)\y/H 
^2" _1 /i!-sA 
and 

f ° 

[ 2"- l n\sft 

Multiply 4> n (y) by a suitable constant to normalize the wave-functions 
\p(x) (solution of (4.27)) to unity and confirm the following matrix 
representations for the position co-ordinate and momentum. 

° ° ° 

y/1 y/2 

y/2 



m¥=n ± 1 
m-n + 1 
m = n - 1 

m=£n± 1 
m=n + l 
m-n — \. 



x = 



/2com 



-J 



#C*>/M 



■ o V 5 " ° ° °- 

-y/2 

ov 5 ° -V 2 ° 



Using the relation 

1 , 1 
H= — p 2 +- 

2m 2 



cj 2 m\ 2 



show that H is diagonal. 
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CHAPTER 7 



Approximate Methods 
of Solution 



7.1 Introduction 

Most problems in quantum mechanics do not possess exact analytic 
solutions. Some do of course, and a few of these have been mentioned 
in Chapter 4. However, approximate methods of solution play an import- 
ant part in the theory and the present chapter, together with Chapter 8, 
will be devoted to a discussion of them. This chapter will deal with 
stationary state problems where the aim is to evaluate the energies and 
eigenfunctions of some time-independent Hamiltonian operator. 

Three methods will be discussed, the variational method, the method 
of linear combinations including the perturbation method and the 
Wentzel-Kramers-Brillouin-Jeffreys (W.K.B.J.) approximation. The 
relationship between the variational and perturbation methods will be 
emphazised. (See also Slater.) 



7.2 The Variational Method 

Theorem 1. The Rayleigh-Ritz variational principle 

Suppose i/> is a wave-function that is not necessarily normalized. Then the 
quotient 

<J^> = j\lj*J^\pdTl^*\pdT (7.1) 

is an upper bound to the least eigenvalue of the quantum system 
described by the time-independent Hamiltonian Jf. 
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Proof 

As the Hamiltonian is time-independent it has a complete set of ortho- 
normal eigenf unctions {$„} such that 

An arbitrary wave-function can be expanded in terms of this set. 
$ - 2 c n <t>„, c„ constant. (7.3) 

Consider the integral 

( «//*3f V dr = f 2c*<j>* & Ic m <l> m dr 

n m 

= 2e„\c„\ 2 . 

n 
\iE x is the ground state of the system then E x <*E n and clearly 

\^*jf^dT>E l 2\c„\ 2 . (7.4) 

J n 

But 



n 
and so 



2k„l 2 = (V\Mr 



I 



Tp*jr\l> dr 

>E X (7.5) 

I \j/*yp d T 

as required. The equality applies when t// is the exact wave-function. This 
is a most important result. In practice an informed guess is made for the 
ground state wave-function which contains some parameters a 1 ,a 2 , . . . 
say. The quotient (7.1) is evaluated and the parameters given the values 
that satisfy 

— — =0 i = l,2,... (7.6) 

so that the expectation value <Jt> is minimized. In general the energy 
eigenvalue obtained is considerably more accurate than the wave-function. 

The method can be extended to find upper bounds to the excited 
states. Suppose it is desired to find an approximation to the first excited 
level E 2 . The choice for ty is then of the form 

*=X-*i JViXdr (7.7) 

where x contains the free parameters. In this case \p is orthogonal to the 
normalized determined ground state wave-function \p i . If 4/ 1 is a good 
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approximation to the true ground state function <j>i then the expansion 
for \p (analogous to (7.3)) will exclude <j> u i.e. 

4> = 2 c„<p„. (7.8) 

It is left to the reader to verify that 

<jr»E 2 . (7.9) 

Theorem 2. The virial theorem 

The virial theorem states that when a system is in a stationary state for 
which the potential is a homogeneous function of the co-ordinates of 
degree s the expectation value of the potential energy is 

<K> = -<7> (7.10) 

s 

where (T) is the average kinetic energy. This theorem holds in both 
quantum and classical mechanics. 

Proof 

Consider a quantum mechanical system composed of n particles of 
masses m t . If they are at positions r,- the Hamiltonian operator is 



jf^-y^-Vf + V(r u ...,r n ) (7.11) 

t—i 2m: 
1=1 

where Vf operates on the co-ordinates of the ith particle only and 
V(r x , . . ., r„) is the potential energy of the system. Since V is homo- 
geneous 

n«i ar„) = a*K(ri *")• C 7 - 12 ) 

Let xO"i r„) be an arbitrary function and define the quotients 

fi 2 



A = 



B = 



Jx*('i, ■ • ->r„) 2 2^ V,?x(ri ' • • ■' In)dT 
Jx*(ri,...,r„)x(r 1 ,...,r„)dr 
J x*(r lf ■ • ., r„)V(t u . . ., r„)x(ri r„) dr 



Jx*(ri.---»r„)x(ri»-.-,r„)dT 



the integrations being over all values of the co-ordinates. 
Consider now the trial wave function 

V/(r 1 ,...,r„) = x(ar 1 ,...,ar„) (7.13) 
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where a is the variational parameter. The expectation values of the 
kinetic and potential energies are respectively 

(T) = a 2 A and (V) = a~ s B. 

(To show this carry out the transformation r,' = ar,-, i = 1 ,...,« and 
note that the Jacobian is the same constant for all integrals and cancels.) 
The expectation value of the Hamiltonian is 

(je> = a 2 A+a~ s B 

and the best choice for a is that which satisfies 



or 



3a 

2 



= 0=2aA- soT*-^ 



-(a 2 A) = (a~ s B) (7-14) 



then 



(V) = -<T). 
s 

In the case that the particles have a charge e t respectively and interact 
through a Coulombic potential 

** *>-ils^T <715) 

and s = — 1 . Then 

(V) = -2(T)=+2(Jtf). 

This result must hold in the case that \j/ is an exact eigenfunction of 
the Hamiltonian with energy E and then 

E = -{T) = \(V). C 7 - 16 ) 

Note that the kinetic energy has a maximum value in the lowest energy 
state. 

Ground state of the helium atom 

The Hamiltonian operator for the helium atom may be written 

* 2m (Vl+V2) 47reor! 47re r 2 4ireolr 1 -r 2 | 

(7.17) 

where the two electrons are at r, and r 2 respectively. The kinetic energy 
of the nucleus and the spin-interaction terms are ignored. 
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If the electron-electron interaction term e 2 /(4ne I rj — r 2 1) is ignored, 
the Schrodinger equation is separable and the helium wave-function is 
simply a product of two ground-state hydrogen type wave-functions 
(see (4.73)), i.e. 



w \ a o/ 



exp 



— (ri +r 2 ) 



(7.18) 



In fact the two electrons will screen each other from the nucleus and 
so a reasonable simple trial function is to write 



^('l.'2) : 



]_(a_ 
■n \a 



exp 



- — (ri +r 2 ) 



(7.19) 



where a is a parameter. This wave-function is already normalized. 
The expectation value of the kinetic energy is clearly 

<r> = 2<7V> (7.20) 

where < 7#> is the expectation value of the hydrogen type atom (Z = a) 
in its ground state. From the virial theorem < 7#> is equal to the binding 
energy of the hydrogen type atom in its ground state and so from (4.71) 



<r> 






1 



J(4ne ) 2 



2 2 

a e 



a (47re ) 
Similarly, the expectation value of the potential energy is 

,2 



4 / e- 

a \47re 1 r t - r 2 



(7.21) 



(7.22) 



where < K#> is the potential energy of the hydrogen type atom (z = a) 
and from the virial theorem 



<V H ) = 



— a 2 e 2 
a (4ne ) 



(7.23) 



The remaining term in the potential energy (7.22) represents the 
interaction of two superimposed spherical symmetric charge distributions 
and it can be shown that 



\47re |ri -r 2 |/ 



5 



8 a (47re„)' 



(7.24) 
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From equations (7 .2 1 M? .24) the expectation value of the 
Hamiltonian is 



{3t) 



= (a 2 -4a+— al- 
\ 8 /' 



e2 (7.25) 



8 / 47re a 
The best value for the parameter is obtained from 

3 «r> e 2 / 2a _2Z, = 



9a 47re a \ 8 

i.e. 

a-tf. < 7 - 26) 

The least upper bound for the ground state energy that can be 
obtained from the trial function (7.19) is 



<JT> 



in. 

V 16 



27 \ 2 e 2 e 1 
11 I — ^_2-848- 

J 47re flo 47re flo 



^-2-848— . (7-27) 



The corresponding binding energy, that is, the energy required to remove 
both of the electrons, is +2-848 e 2 /47re a - The experimental value is 
+2-904 e 2 l4ire a . 



7 .3 The Method of Linear Combinations 

As has already been mentioned in the introduction, this chapter deals 
with stationary perturbation theory in which the Hamiltonian is time- 
independent. The variation principle may be used to derive the appro- 
priate expansions. 

The object is to find the eigensolutions of some given Hamiltonian 

jf. That is, to solve 

XV-**. (7 - 28) 

Let {x/} be a set of « linearly-independent functions which for the 
moment will not be assumed to be orthogonal. Observe that this is a 
finite set of functions and so is not complete. Suppose this set is used to 
give an approximate expansion for an eigenfunction of Jt, i.e. 
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The wave-function (7.29) may be used in a variation problem where the 
expansion coefficients are regarded as parameters. The expectation value 
for the Hamiltonian (7.1) is 



<jf> = 



f 2 cTxtsri. war 

J i = 1 i = l 

/• n n 

J 2 c* x* 2 CjXj dr 
J i=i /=i 

2 2 cfMjCj 

' = 1 7=1 

n n 

2 2 cts ilCj 

1=1 /=! 



(7.30) 



where 



and 



5„ = Jx*X/dr. (7.31) 



In matrix notation 

c*Hc 

<Jf> = 4^. (7.32) 

c*Sc 

H and S are both Hermitian and c is a column matrix. A very simple 
argument shows that this expectation value is real. As a scalar quantity 
is equal to its own transpose 

cHc* 
cSc* 

However as H and S are both Hermitian then 

cH*c* 

cS*c*' 

This final result shows that < 3#*) is equal to its own complex conjugate 
and so is real. 

The best trial solution of the type (7.29) is that which satisfies 

^pP-«0 Z-l,2,...,« (7.33) 

aci 
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2, JCljCj — „ n 
;=1 V V .* 



2 % cf^ijCj „ 

2 s H =o 



2 2 c*S lf Cj i 
i i 



1 = 1,2, ...,n 
(7.34) 



Writing E A for the real approximate value of the energy, these n 
equations are in matrix form 

Hc-^Sc^. (735) 

If Of) is differentiated with respect to c t instead of c* an equivalent set 
of equations is obtained. 

The equation (7.35) can be obtained directly by substituting (7.29) 
in (7 .28), integrating and replacing E by E A . The condition for the set 
of n equation (7.35) to have a non-trivial solution is that the characteristic 
determinant be zero, i.e. 



det(H-E A S) = 0. 



(7.36) 



The determinant (7.36) is a polynomial of degree n in E A and the lowest 
root is the best approximation to the ground state of the system. 

Hydrogen-molecular ion H% 

This is the simplest molecular structure and may be approximated by a 
one-electron problem, the two nuclei being in fixed positions. The 
electron Hamiltonian is 



h 2 _j 1 (e 2 e 2 

3tf= V 2 — + — 

2ot 47re \r 1 r 2 



4ne R 



where r u r 2 are the distances of the electron from the protons which are 
a distance R apart. This problem can be solved exactly in elliptic co- 
ordinates and the energy found for different values of/?. The minimum 
energy occurs when R = 1 -06 A in agreement with experiment. The 
corresponding binding energy is 2-8 eV. 




+ e R 

Figure 7.1 The hydrogen molecular ion. 
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However, as an illustration, this problem is solved by expressing the 
molecular orbital as a linear combination of atomic orbitals. The simplest 
approximation to the ground state is 

where \p u ty 2 are the real normalized hydrogen (Is) wave functions 
centred on the two protons and Cj, c 2 are constants. 

The best solution of this type is obtained by solving the characteristic 
determinant 



■^11 _ E A ^C x2 - E A S l2 

<^2i - E A ^21 ^"22 - E A 
where 

Sv. =S 2 i = J ^1^2 dr 
as \p i and i// 2 are rea l functions, and also 

^12=^21=f>/'l^' , /'2dT 

■*ii =J#?i=\i>^i>\ dr. 
The two roots of the determinant are 



= 



E A = 



1-S 



12 



As \pi and i// 2 are both positive in the region of overlap then it can 
be shown that «?f 12 < and so the lowest root is 



„ Jt\\ f Jin 
Eai ~ 1+5 ' 



12 

This corresponds to a binding energy of ~ 1 -8 eV which involves an error 
of 1 eV. 

The values of c x and c 2 are obtained by solving the two equations 

(^11 -Ea) Ci +{Je l2 -E A S 12 )c 2 = 

(Jt 2l -E A S 21 ) Cl +(JT 22 -E A )c 2 =0. 
By putting E A =E A1 it can readily be seen that 

C\ =c 2 
and so the normalized ground state is 



*g=Wi+<p2)/>/2(i+s 12 ). 
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This is an even function, (gerade or^-state) and represents a bonding 
state in which charge is transferred to the bond, i.e. 

The other root of the determinant corresponds to the odd function 
(ungerade or « -state), i.e. 



*«=(*! -*2)/V2(l-5 12 ) 

and represents an antibonding state in which charge is removed from the 
bond. 



Theorem 3 

Let E r be the rth root of (7.36) and Crj.j = 1 ,...,«, the corresponding 
set of expansion coefficients. The n approximate eigenfunctions are 

</V= IcrjXj r=l,2 n. (7.37) 

These approximate eigenfunctions are orthogonal. 



Proof 

Let E q and E r be two different approximate eigenvalues with corresponding 
eigenfunctions \J/ q , i|/ r 

\p*4> r At = 2 2 c*,^ I x*Xf dr = 2 2 c*-5,/ c r/ 
J i = 1 / = 1 J i /' 

= c|Sc r . (7.38) 

From (7.35) 

He, = E q Sc q (a) Hc r =£VSc,. (b). 
Transpose (a) and take the complete conjugate 

c*H* = E q c*S*. 
As the matrices H and S are Hermitian 

c*H=E q c*S. 
Post multiply this equation by c r and premultiply (b) by c, 

c*Hc r = E q c*Sc r c*Hcr=E r c*Sc r (7.39) 

On subtraction these equations give 

(E q - E r )c*Sc r = 0. 
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If the eigenvalues are not degenerate, this equation implies 

c^Sc r =0. 

From (7.38), the approximate eigenfunctions are orthogonal. If these 
functions are normalized then 

JVjlMr = 8, r . (7.40) 

Discussion 

The linearly-independent functions fa/}. ;' = 1, . . ., « define an n- 
dimensional sub -space of the system function space. In the Rayleigh-Ritz 
variational method the best approximation to the ground state is 
obtained by minimizing the quotient (7.1). The best approximation to 
the rth excited state is obtained by minimizing (7.1) subject to the 
condition that the trial function is orthogonal to the (r — 1) approximate 
eigenfunctions for the lower eigenvalues. The results of the Rayleigh-Ritz 
method are upper bounds to the true eigenvalues. 

Calculated levels 



Energy 



n = 3- 



True ground state 
Figure 7.2 

The method of linear combination is essentially the Rayleigh-Ritz 
method with the trial functions confined to the n-dimensional sub-space 
{x>}. The n roots of the characteristic determinant 

D„(E A )= det (H-E A S) = (7.41) 

give the corresponding best possible approximations to the lowest eigen- 
values. Of course the approximation is best for the deepest levels. 
Consider now the determinant D n + 1 (E A ) constructed from the 
(n + 1) independent functions Xi, ■ • •, Xn> Xn + i- This problem is 
equivalent to Rayleigh-Ritz minimization method with the trial 
function i// restricted to the (n + 1 )-dimensional sub-space spanned by 
{X/},/=l n + l.i.e. 

* = "l c, Xt . (7-42) 

/ = 1 
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The greater flexibility of the form of ^ implies that the roots of 
D n+ i(E A ) will be less than or equal to the corresponding roots of 
D n (E A ). If the rth roots are written E^ n + 1 ^ and £•<">, the superscript 
denoting the dimensionality of the associated function space, then 

£V<" + 1 > <£•(">. (7-43) 

The roots of the new determinant D n+l (E A ) give a better approximation 
to the previous n eigenvalues as well as introducing an approximation to 
another eigenvalue. 



Perturbation of non-degenerate levels 

The method of linear combinations can be used to obtain the well-known 
perturbation expansions. Consider the case where the Hamiltonian can 
be written 

3^ = 3^° + A (7.44) 

where the energy associated with Jf° is large compared with that 
associated with A. JP° must be chosen so that its eigenvalues and eigen- 
functions are known. Let the functions {x,-} be these eigenfunctions 

It is convenient to consider the Hamiltonian 

jf?~jr°+\A 0<X<1. (7-45) 

The parametric value X = 1 corresponds to 'complete' perturbation and 
X = to zero perturbation. The Hamiltonian (7.45) is solved and then X 
put equal to unity in the results to obtain the required solution of (7.44). 
The unperturbed Hamiltonian eigenfunctions {x/} are orthogonal and 
may be chosen to be normalized. Following (7.29), the first n of these 
eigenfunctions, corresponding to the n lowest energies, are used as an 
expansion set. The overlap integrals (7.31) are 

Si, ~ 5,-/ 
and the elements of the Hamiltonian matrix are 

je^E^a+Xhij 
where 

hi,=jx*AXi^r. 

The n equation (7.35) become 

t (E^tj+Xh,) - E^^Cj = / = 1, 2, . . .,n. (7.46) 
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In the unperturbed case X = 0, the determinant of (7.46) is diagonal 
and 

Y\(E?-E A ) = (7.47) 

giving the n solutions 

E A =ElEl...,E« 

as expected. 

The basic assumption of perturbation theory is that it is possible to 
expand the perturbed energies and eigenfunctions as power series in X. 
There are some cases when this assumption is invalid. For example if 
jfo has a discrete spectrum and «?f has a continuous spectrum. For the 
ground state, the unperturbed eigenvalue and eigenfunction are E® and 
Xi, respectively. The perturbed ground state energy is written 

E 1A =E? + \E{» + X 2 £-{ 2) + . . . (7.48) 

and the perturbed ground state wave function 

^,=Xi+X^+X 2 ^ 2) + .... (7-49) 

e{ 1) is of the same order as A and E{ 2) is of the second order of smallness 
(The reader is warned not to confuse E^ etc. with the entirely different 
meaning given to E{"^ in (7.43).) 

With the energy expansion (7.48), the characteristic determinant of 
the equations (7.46) becomes 

Xh n - \EP - X 2 E[ 2 \ Xh n , . . . Xh ln 

Xh lu Xh 22 + £- 2 ° - £? - X£f > - X 2 E[ 2 \ . . . 



Xhn1 ' , r ^ 

(7.50) 

This determinant is a polynomial in X of degree n and is identically 
zero for < X < 1 . The coefficient of each power of X must vanish. 
The term in X, corresponding to the first order in smallness, is the 
diagonal determinant 



XQiu-EP), 0, 0,...,0 

0, El -El 

E°,-E°, 



E2-E? 



= 
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i.e. 

(h n - E{»yE$ - E?) . . . (ES - Ef) = 0. (7.51) 

If the ground state eigenvalue E® of the unperturbed Hamiltonian is 
non-degenerate 

E[ l) =h n =\x**Xi*T. (7.52) 

This is the first-order perturbation of the ground state energy. By taking 

E jA = E t ° + \E,M + \*Ef 2) + ... (7.53) 

it can similarly be shown that the first order perturbation of the/th 
state, (assumed non-degenerate), is 



E^=h jr \xUXi^ 



and so 

E jA =E?+\j X *d X jdT. (7.54) 

For complete perturbation X = 1 and so to first order, the energy of the 
perturbed/th state is 

(Strictly Eja is a power series in X and ought to be investigated for 
convergence to decide if it is valid to put X = 1.) The energy (7.55) is 
the expectation value of the Hamiltonian (7.44) for the unperturbed 
wave function x/. 

The first-order change in the ground state wave function is expanded 
in terms of the first n functions of the set {x/} so that 

*p>- icp Xi 

and 

*i =0 +M°)xi +x iU (1) x/- 

When E 1A is given its value (7.54), the n equations (7.46) are, keeping 
first-order terms only, 

h n +(E?-E«)cP = l = 2,3,...,n 

C ' (1> = E\ h -E? / = 2,3,...,«. 

To first order in X, the first of the equations (7.46) is the trivial identity 
zero equals zero and so c^ is not obtained by this method. The value of 
c} 1 ' is chosen so that 1^1 is normalized and by considering first-order 
terms it is readily shown that the choice c^ - is satisfactory. 



166 



A FIRST COURSE IN QUANTUM MECHANICS 



So, the coefficients in the expansion (7.29), for the ground state wave 
function, are to first order 



C =1 



cf 



,( 1 >=- 



h n 



E? 



E? 



1 = 2, 3,. ..,n. 



(7.56) 



By an exactly similar argument the coefficients for the /th state wave 
function are, to first order 



c, = l 



cP = 



hij 



*?■ 



■E? 



i*i 



(7.57) 



and for complete perturbation with \ = 1 , the /th perturbed wave 
function is 



*/=x/+ 2 



»// 



>*i 



E,° 



E pXi- 



(7.58) 



The second-order changes due to the perturbation can also be calcu- 
lated. To find the second-order change in the energy of the ground state 
the determinant (7.50) is expanded by the first row keeping only those 
terms in \ 2 . Then 

\ 2 {-E{ 2 \E 2 ° - E?)(E$ - if?) . . . (ftf - if?) 

- \h n \ 2 {E% - * ?) • . . (2# - *?) - I h a \ 2 (E 2 - E )(E 4 - E?) 

. . . (ES - *?) . ■ • - 1 h t n \ 2 (E? - E?) . . . (E°_, - E?)} = 0. 

This is equivalent to the bordered diagonal determinant 

-\ 2 E{ 2) Xh 12 Xh 13 ... \h ln 

\h 2l Eg-E? ... 



Xh 3l E§ - £? 



\h 



nl 







E°-E? 



= 0. 



On the assumption that the ground state energy is non-degenerate, 
the second-order change in the energy of the ground state is 



^ 2) =-2 



Mil 



1=2 



E?-Ef 



This result may be obtained directly by substituting (7.56) into the 
first of the equations (7.46) and picking the second-order terms. Observe 
that the second-order change in the energy of the ground state is always 
negative as E? > if?. 
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By a similar argument, the second-order change of the energy of the 
fth state, (assumed non-degenerate), is 

\h„\ 2 



^>=-2 



E-0 _c-.o- 

/#/ £ ' £ / 

So to second order, the perturbed energy of the/'th state with X = 1 is 
E,a =EJ> + h,j- J -Jfi^o- (7.59) 

14- j ' ; 

Note that if the wave function is known to first order (7.58), enough 
information is known to calculate the energy to second order. 

If the number of basis functions is increased the calculated value will 
be improved. In the case that the complete set of eigenfunctions of 3f?° 
is used, the summations (7.58) and (7.59) include an infinite number of 
terms. These are obtained directly in the more usual method of obtaining 
the perturbation expansions but this does not show the relationship 
between the variation and perturbation methods. 

The perturbation method is applicable when although the true 
Schrodinger equation cannot be solved, solutions can be found for a 
derived equation, which differs from the true one by terms that are 
small. Often this cannot be done or perhaps the convergence is poor. 
Then the variation method is often useful. 

Van der Waals interaction 

The attractive forces that hold most atoms together to form molecules 
are very short range and can be ignored for atomic separations greater 
than a few angstr5m units. Even so there are weak attractive forces that 
act between neutral atoms at separations large compared with the Bohr 
radius. These are the van der Waals forces and they vary as 1/R 6 where 
R is the atomic separation. They are due to a mutual polarization of the 
two atoms and help to determine the properties of gases. 

Consider two hydrogen atoms at a fixed sitance R apart. This is a 
two-electron problem and the Hamiltonian is 

h 2 



2m 



,„ 2 rt2 , 1 ( e 2 e 2 e 2 e 2 \ 

(Vi+Vl)--; + — + + 

4™?o \r la r lb r 2a r 2b j 



e 2 e 2 

+- 



4ne ri2 4rre /? 

where r la , r lb are the distances of the first electron from the first and 
second protons at V and V and similarly r 2a , r 2b a re the distances of 
the second electron from the protons. The electron separation is r 12 
(see Figs. 7.1 and 11.2). 
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The Hamiltonian 



2m 4tre \r la r 2b ) 



represents the separated hydrogen atoms and has known exact solutions. 
The 'atom-atom' interaction Hamiltonian 



A = 



47re \7-i 6 rjaj 4ire \r 12 RJ 



can be regarded as a perturbation. 

For the ground state both electrons are in the (Is) state (about 
different protons) and the wave function is 



exp^i(ri,r 2 ) = -| — ) exp 



* \a J 



- — (ria+r 2b ) 



It is convenient to choose a co-ordinate system with the z-axis along the 
line of atoms and an expansion of A in powers of \\R, keeping the first 
term only, gives 



e 2 



47TC R*( XlaX2b + yiay2b -2zi a Z2&) 

where (x la ,y ia , z la ) and (x 26 , y 2b , z 2b ) are the co-ordinates of each 
electron relative to their respective protons. This can be written 

3(Pi-R)(p 2 -R)\ 



6^( Pl 



47re OJ R ;, \ r '" P2 R 2 

where 

Pi =-e(x la i+y la i+z ia k) 

and similarly for p 2 . The perturbation represents the mutual interaction 
energy of the two atomic dipoles at any instant. 

Since A is an odd function the first-order perturbation integral 
vanishes. The second-order energy change of the ground state is 



w-l-a 



l> 1 



E?-E? 



where h u is the matrix element between the ground state and the /th 
excited state. Since E? >E^ the energy change is negative and represents 
an attraction between the atoms. The form of the perturbation indicates 
that^a-l/tf 6 . 
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Eisenchitz and London (Z. f. Phys. 60, 491 , 1930) carried out this 
calculation and obtained the result 

M 

(4ne ) 2 R 6 



tf'-m 



The second-order Stark effect 

Consider the effect of an electric field S, acting along the z-axis, on the 
ground state of the hydrogen atom. The unperturbed Hamiltonian is 



ft 2 e 2 

^o = * V 2_. 



2/x Aite r 

where pt is the reduced electron mass. The perturbation 4 is the extra 
energy of the system due to the electric field and is equal to -p . & 
where p is the dipole moment of the atom, i.e. 

4 = e Sz. 

(The electron change is - e). 

The unperturbed ground state wave-function is (4.73), 



"U«J 



1/2 

-i-K 



^ 00 = [^J C 

As this function is spherically symmetric then clearly the first-order 
perturbation to the ground state energy is zero. 

However there is a second-order change in the energy. From (7.59) 

this is 

q = 2 

where z lq = f'PiooZ'I'q dr ?ind ^« is ** unperturbed excited hydrogen 
wave-function. From the form of the hydrogen atom wave-functions 
(4.72), it is clear that z lq is non-zero only for those excited states for 
which the quantum number m = 0. By further symmetry arguments it 
can also be shown that z lq is non-zero only when the excited state / 
quantum number is one. 

The change in the ground state energy is proportional to the square 
of the field strength and a polarizability a may be defined so that this 
change is -aS 2 l2. From (7.60) 

q = 2 H 
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The dominant term in (7.61 ) is clearly that obtained from the (2p z ) 
state when n = 2. A fairly short calculation then shows that 

a>2-9(ag47re ). 

ITie experimental value is a = 4-5 (flo4;re ). 



Perturbation of a degenerate level 

The perturbation theory developed up to this point has been concerned 
only with non-degenerate levels. However, degenerate levels occur very 
frequently and this work must be extended. This is not in fact very 
difficult. 

It is clear that the results already obtained do not apply for a degener- 
ate level as there would then be singularities in both the first-order expan- 
sion coefficients (7.58), and in the second-order change in energy (7.59). 
Examination suggests that these difficulties could possibly be removed 
if the perturbation interactions hy between degenerate levels, could be 
chosen to be zero. 

As a simple example consider the case of a two-fold degenerate level. 
In particular, in the previous treatment, (7.44-7.59), let E% = E$ . The 
first- and second-order corrections to these degenerate levels are given by 
the solutions of (compare (7.50).) 



\h u -\EP-\ 2 El 2 \\h n ,..., 

M21. Xh 22 -XE^-X 2 E{ 2 \ 

u 33 +eI-e?-\eP-x 2 eP, 

X/t„i 



^m 



= 0. 



(7.62) 



This determinant is again a polynomial in X. The lowest power is X 2 
and this term is the determinant 



\h u - XE\*\ Xh 12 , 



Xh 2l , Xh 22 -XE{ 1 \ 

0, e!- 



E°-E? 



= 
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i.e. 

(A„ - EPW>22 - *P>) - IA12 1 2 = 0- (7.63) 

The two roots of this quadratic give the first-order changes in the 
degenerate levels. Generally these roots are distinct and the perturbation 
removes the degeneracy. 

The degenerate basis functions can always be chosen so that the sub- 
matrix referring to the degenerate levels is diagonal. That is, in the 
present case, it is possible to obtain from Xi and X2. two new basis 
functions. 

xi = c n Xi +^12X2 

X2 =c 2 \Xi +C22X2 (7.64) 

so that the off -diagonal elements 

h'n = Jx?* Xi At = 0, h' 21 = Jxa^xi dr = 0. 

In this case the two solutions of (7.63) are 

£i (1) =Aii=JYWid'- and E[» =h' 22 = {x 2 **X 2 dr. 

The first-order effects of the perturbation has split the originally 
doubly-degenerate level E?, into the separate levels 

E?+h' n and £"? +A 22 - (7.65) 

The functions xi and X2 are the zero-order wave-functions in that the 
perturbed wave-functions must approach them as X ->■ 0. For the 
perturbed level ■fif + h' u , the first-order wave-function is 

* 1 = x'i + X [c2 1} X 2 + .2 c ( p X ] + .-.. (7.66) 

Keeping first -order terms only, equations (7.46) give (after 
diagonalizing the sub-matrix), 

h' n +(E?-E?)cP=0 / = 3,4,...,« 

where 

h'n =Jx*^Xi dr. 

There are no first -order terms in the first two equations. The second-order 
terms in the second equation give 

(*m - * ii)^ + 2 h' v cj m = where h' y = Jxa^X/ dT. 
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So the first-order coefficients for the wave-function with perturbed 
energy if? +h' u are 



ci=l 



40 =. 



c?> 



h'n 



-z 

/ = 3 

7>2. 



(h' 22 -h' n )(E?-E?y 



-W^eT)""- (7 - 67) 

It is left to the reader to find the first-order wave-function correspond- 
ing to the level Ef +h' 22 . 

This procedure can obviously be generalized to a degenerate level of 
any order. The problem is really to find the correct 'zero-order' 
(unperturbed) wave-functions that diagonalize the appropriate sub- 
matrix. With this choice of basis functions the first-order change in the 
energy is given by the same formula as in the non-degenerate case (7.52). 

The second-order changes in the energy can also be calculated. In the 
doubly-degenerate example discussed above the expansion of the 
determinant (7.62), keeping only those terms in X 3 gives (with^f 1 * = h' n 
and h'12 =h' 2 i =0) 

-\ 3 E{ 2 \h' 22 - EP)(E$ - 2?°) . . . (£•» - E?) 

-^\h' 13 \ 2 (h' 22 - E<t»)(E$ - E?) . . . (£» - E°) . . . 

-* 3 \h[ n \\h' 22 - E{»)(E° - E?) . . . (£*_, - E?) = 



i.e. 



A 2 £l 2 > 


XAia 





X(/*22-4 1} ) o 


M' 31 


(FS-Ei 



M', 



In 



\h' nl 







(e2-e?) 



= 0. 



(7.68) 



(This is of course the determinant (7.62) with the correct zero-order 
basis functions.) 

The second-order change in the level E° +h' n is 



*f>-2 



\h' u 



El 



o • 



The second-order change in the level E° + h 22 is 

n i,i |2 



(7.69) 



(7.70) 



m =3 
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These results can Be extended to a degenerate level of any order. 
When the correct set of basis functions is used the non-degenerate formula 
(7.59) applies except that the sum excludes all unperturbed levels 
degenerate with the level considered. In this way the singularities are 
avoided. 



The first-order Zeeman effect 

It is well known that degenerate atomic levels may be split when a 
magnetic field is applied. This is the Zeeman effect, and the theory for 
the hydrogen atom is outlined below. 

The classical Hamiltonian for a charged particle in an electromagnetic 
field was derived in Chapter 1 (1 .45) and it was shown in Chapter 3 
(question two) that to obtain a Hermitian operator on quantization it 
should be written (for an electron of charge -e and mass m) 

^ = ^- + ^-(P-A + A.p)+f-^ 2 -eV (7.71) 

where the magnetic induction and electric vectors are given by 

3A 
B = curlA and <¥=-gradV-— . (7.72) 

ot 

The quantum mechanical Hamiltonian is 

h 2 e e e 2 A 2 

Jf=- — V 2 --/ftA.grad-— «divA+— — -eV. 
1\i m 2n 2ju 

(7.73) 

For a hydrogen atom in a uniform induction field of magnitude B 
parallel to the z-axis the vector and scalar potentials may be chosen so 
that 



A x =--By A y =-Bx A z = V = 7I7Z . (7.74) 



\By A y =\... ... _ . ^^ 

As div A = and 



*-—H-'s~£K*£ ( " S) 

(where is the polar angle) the quantum Hamiltonian is 



jtr^-^-V 2 --^--m-§^- + fB 2 (x 2 +y 2 ). (7.76) 



2/x 4ire r ju 2 d<f> 8m 

For the fields normally met in the laboratory the term in B 2 is much 
smaller than the electric field term —e 2 /47te r and can be ignored. The 
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unperturbed Hamiltonian is that of the free hydrogen atom and the 
perturbation term is linear in B. 

*** **--A *=-**■—■ (7 - 77) 

2ju 4ire r M 2 b<j> 

The eigenfunctions of the unperturbed Hamiltonian are 

R n ,(r)Pr '(cos 0) e ,m '*. (4.72) 

The quantum number n determines the energy and there are degenerate 
eigenfunctions for 

/ = 0,1 (n-1), m, = -/,-(f-l),...,0,..., + /. 

The functions (4.72) are already eigenfunctions of 4 with eigenvalues 
ehBmi/2(x and so the perturbation matrix is diagonal. When the magnetic 
field is applied the degenerate levels with the same quantum number / 
and n are split into the (2/ + 1) levels with energies 

En+^-m, m, = -/,...,+/. (7.78) 

These energy levels explain the spectrum of the normal Zeeman effect. 
However there are additional effects due to the spin magnetic moment of 
the electron and these will be discussed in Chapter 10. 



7.4 The W.K.B J. Approximation 

This method carries the names of G. Wentzel, H. A. Kramers, L. Brillouin, 
and H. Jeffreys and it can be used to deduce the Bohr-Sommerfeld 
quantum conditions from quantum mechanics. The W.K.B.J. approxi- 
mation is applicable to ordinary differential equations. That is, one- 
dimensional eigenvalue problems or three-dimensional problems that can 
be reduced to one-dimensional form by separation of variables. It is 
useful when the potential is a slowly varying function of position in the 
sense that the fractional change of potential is small over several de 
Broglie wave-lengths. This will arise in the case of large quantum numbers 
when the wave-function has many nodes and the distance between 
them is small, corresponding to a short de Broglie wave-length. This 
method has been termed 'quasi-classical' in that it is most appropriate in 
the almost classical limit of large quantum numbers. Consider the one- 
dimensional, time-independent Schrodinger equation, 

^ 2+ f ( E-V( X ))*=0. (7.79) 

In Chapter 4 it was shown that the classical limiting form of the wave- 
function is essentially exp (iS/h) (4.9) where S is Hamilton's characteristic 
function. 
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Look now for solutions of (7.79) of the form 

iK*) = exp(i4/ft) (7.80) 

where A (x) is some function of position. For \jj to satisfy (7 .79) .4 must 
satisfy the non-linear equation 



(#- 



^-2m(E-V) = 0. (7.81) 



Assume that it is now permissible to expand A (x) in series in h. 

h 2 
A(x)=A (x) + hA 1 (x) + Y A 2(x) + • • •• Om 

The classical limit corresponds to ti -*■ and as expected A (x) is related 
to Hamilton's function S(x). Quantum effects are included in the higher 
terms. If (7.82) is substituted into (7.81) and terms in and higher powers 
ignored then, remember ft is small 

(dAn\ 2 

1 — °* =2m(E~V), (7.83a) 



(£)' 



i.e. 



A = ± j {2m (E - V)} in Ax. (7 .83b) 

This is the expected result since the classical momentum 

p(x) = {2m(E- V)} 112 
and Hamilton's function 

S(x, t)<=-Et±jpdx. 
A wave-number k(x) may be introduced with 

2m(E-V)=p 2 (x) =h 2 k 2 (7 .84) 

and then 

A (x) = ±hj k(x) dx. (7 .85) 

If the first two terms of (7.82) are now substituted into (7.81) and 
the terms up to and including first order in H retained 



dA 
dx 



tfH^X-)^-'-*-' - - 
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From (7.83) this becomes 
lo 



d 2 A, 



dA t _ 
dx 

In terms of k. 



dx 2 

&A 

dx 



A 1 (x) = -\nk(x). 



(7.86) 



The approximate solution of Schrodinger's equation (7.79) obtained 
by using the first two terms in the expansion (7.82) is 

*(*) = TTn [ K i ex p(' j k dx ) + K 2 exp (-/ jk d*)] (7.87) 

where K t and K z are constants chosen to satisfy the boundary conditions. 
If E > V, k is real but in the classically forbidden regions where V>E,k 
is imaginary. 

The validity of the approximation requires (in (7.81)) 



d 2 A 
dx 2 



® 



(7.88) 




If this is evaluated using A , and writing X(x) = 2n/k(x) 



11. 



That is, the approximation will be good if the fractional change in 
(E - V) over the de Broglie wave-length X is small. Clearly the approxi- 
mation breaks down near those points where E = V(x). Classically these 
are the turning points of the motion where the kinetic energy and hence 
the momentum is instantaneously zero and the particle changes 
direction. 

At these turning points k = corresponding to an infinite de Broglie 
wave-length. Then the approximate form (7.87) is singular although the 
exact wave-function will not be. In fact the expansion for A(x) (7.82) is 
not a convergent series but is asymptotically correct and can only be 
used several wave-lengths away from a turning point. 

Turning points generally occur in problems and a method must be 
developed to deal with them. Suppose there is a single turning point at 
x = a and 

E<V(x) x<a, 

E>V(x) x>a. 
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Figure 7.3. 

In some region about V say,af 1 < x <jc 2 the approximate wave-function 
(7.87) is not valid. 

Outside this region (7.87) may be used with k given as below. 

(a) For x > x 2 



k = Ql^l (E _ v yn >0 
fl 



(7.89) 



(b) Forjc<#, 

k = iy 
and 

7 = ^f-V-iO 1/2 >0 (7.90) 

The physically acceptable solution must vanish as x -*■ — °° and is 

a 



<K*)~y/2 ex P 



-J 7 dx 



x<x 



(7.91) 



Observe that the exponent decreases as x moves away from 'a'. The 
problem is to find the corresponding solution of the form (7.87) valid 
for x ~>X2- 

As V(x) is analytic it can be expanded in Taylor series and in the 
linear approximation it is represented by two terms 



V{x)=E-B(x-a) B>0. 



(7.92) 



In the region x t < x < x 2 where (7.87) does not apply, Schrodinger's 
equation is solved exactly using the potential (7.92), i.e. 



dV 2m , 
-^2+^TB(x-a)^=0 



Xi <x <x 2 . 
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The independent variable may be changed to 



y= -\ir) {x ' a) 



and then 

d^> 
dy 



TT-^ =0 - 



(7.93) 



In terms of this variable the exponent in (7.91) can be written (from 
(7.90), (7.92)) 



\ydx=\y™ 



and similarly (7.89), (7.92) 

X 

\kdx=hy\™. 



(7.94) 



(7.95) 



Equation (7.93) is Airey's differential equation. One of the two solutions 
is finite for all y and the asymptotic form of this for large \y | is 

Hy)~f p74ex P (-f;> 3/2 ) y>o (x<a) 

D i^"(i" ,M -T) 



Hy) 



y<0 (x>a). 

(7.96) 



(The second solution is also oscillatory for y < but increases 
exponentially for y > 0.) 

From (7.94), (7.95) and (7.96) it is clear that the solution of the 
form (7.87) that is valid for x >x 2 and connects with (7.91) is 



lK*)~ p72" cos 

This can be written 
1 



J* 



dx 



„l/2 



exp 



-j Y dx 



A/2 



COS 



x>x 2 - 



k dx 



(7.97) 



The direction of the connection arrow is important. For further details 
see Kembles's Quantum Mechanics. 
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Figure 7.4 

The Bohr-Sommerfeld conditions 

The problem here is to find the energy levels for a particle moving in a 
one-dimensional potential well. 

Consider the diagram. For any energy E, there are two classical 
turning points, say x = a and x=b. 

The asympotatic W.K.BJ. solution valid outside the region a < x < b 
must converge as | x \ -*■ <*> and so (from (7 .87)) 



*(*)' 



*(*)' 



1 
372 ex P 



1 

172 exp 



-J 7 dx 

X 
X 



x<a 



x>b. 



The connection formula at x = a is (from (7.97)) 



1 
172 ex P 



u 

-j 7 dx 



■If' 



Jin cosUkdx-1 



Similarly the connection formula at x = b is 



An 



cos 



kdx 



1 



.,1/2 



exp 



-Jrd* 



The l.h.s of (7.99) can be written 



A/2 



cos 



ffcdx- — -a 
i 4 



where 



(7.98) 



(7.99) 



(7.100) 



.-J 



ifcdjc 
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For the two connection formula to be consistent the r.h.s. of (7.98) 
must be the same function as the 1Ji.s. of (7.99). Since a sign change is 
immaterial then a must be zero or a positive integral multiple of it, i.e. 

b 

"fcdjc-y =mt, n =0,1,2,.... (7.101) 



u 

V 



The integral over the period of the classical motion is 
b 
jkdx=2 J kdx = 2n(n+i). 

a 

Writing p =Hk this condition becomes 

jpdx = (n+i)h, «=0,1,2 (7.102) 

The equation (7.101) (or (7.102)) determines the eigenvalues of the 
problem. The last equation is the Bohr-Sommerfeld quantization rule 
with half-integer quantum numbers. 



PROBLEMS 

1 Suppose \jj is some normalized approximate wave-function for the 
ground state of a system. Define the integrals 

/ = JV(.3f-£i) 2 ^ dr and K n = JV^f"^ dr, 

where ^f is the Hamiltonian operator. By expanding $ in terms of the 
exact eigenfunctions of Jt? show that 

I>(E t -Ktf 

where E x is the true ground state energy, if K l is closer to E\ than to the 
energy of any higher state. 

Express / in terms of K\ and K 2 and show that 

E 1 >K 1 -{K 3 -K}) in . 

This formula, due to Weinstein, provides a lower bound to the ground 
state energy. 

2 Equation (4.26) gives the Hamiltonian for a linear harmonic 
oscillator. Using the normalized function 



n (a z +jc z ) 

obtain the expectation value of the Hamiltonian. Use the variational 
principle to show that the best choice for a for the ground state is 
l* 2 /2mky'* and that the best value for W > isff(fc/2m) 1/2 . 
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3 Consider the one-dimensional particle whose Hamiltonian is 

3*=-— — -J +kx*. 
2m dx 

Using the non-normalized trial function 

Wx) = e- a * x2 ' 2 

show that the expectation value for the energy is 

Am 4a 

Using the variational principle show that the best value for a is 
(6kmfK 2 ) 1/6 and obtain the corresponding value for the ground state 
energy. 

You may assume the integral 

f e- a2x2 dx=y/i/a. 



4 Suppose a system of classical particles is in motion. Newton's second 
law states that xi = mfii where Xi is the force in the ^-direction acting 
on the ith particle of mass m,. Confirm the expression 

x t m t . , 1 d , 

jX i -- T x i+ --{m i x i x i ). 

Show that provided the particle is confined to a finite region of space, 
the time average as t -*■ °° of the last term on the right-hand side is zero. 
For conservative systems the forces acting on the particles may be derived 
from a scalar potential. 

3 
Xi = -7^-V(x u y u z u ...). 

If V is a homogeneous function of the co-ordinates of degree n show 
that 

2 

where Kand Tare the time averages of the potential and kinetic energies 
of the system as t -*■ «>. 

(You may use Euler's theorem 

2 \Xi-r— + yfr— + *| — \ = nV.\ 
t \ *Xi 7t by, 'dz t J ) 

This is the virial theorem of classical mechanics. 
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5 Let rj and r 2 be the position vectors of two points in space relative 
to some origin. A well-known result is that 



In 






n >r 2 



(cos 0) r 2 >/"i 



where is the angle between r! and r 2 and P„ is the Legendre polynomia 
of degree n. Another result is that 

p„ (cos 0) = p„ (cos i)p„ (cos 2 ) 

+ 2 (-l) m 2^^ir(cose 1 )i , n m (cos0 2 ) 

m = ! (« + OT)! 

x cos m (0i — 2 ) 

where (0 j , t ) and (02, # 2 ) are the polar angles corresponding to rj and 
r 2 respectively. 

Hence show that if 



iKri 



■--m 



exp 



- — (n +r 2 ) 



then 



JV 



1 



5 e 2 
i|/dT,dr 2 =-a- 



47re y In -r 2 l ' " 8 a 4ne 

This is the result (7.24). You may assume that 



+i 

j>„Gx)dn = «>0 



and that 

^00= I- 

6 Consider the eigenvalue equation (7- 28 )- Substitute for i// using 
(7.29). Premultiply the equation by xi and integrate and confirm that 
(7.35) is obtained. 

7 Confirm that the perturbed wave functions (7.58) obtained by 
choosing c/ = 1 are orthonormal to first order. 
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8 Suppose the Hamiltonian tf is a function of the parameter \ 
(e.g. (7.45)). Let i//„ be the normalized eigenfunction with eigenvalue 



(that 



Show that 
d£„ 

dX J " " 9X 
This is the Feynman theorem. Note that \jj„ will depend upon X and that 

— (V**„dT = 0. 

In the case of perturbation theory dJ^/dX =4 (7.45). By expanding 
both sides of (i) in powers of X show that this equation is consistent with 
the perturbation theory results (7.54), (7.58), (7.59). 

9 The Hamiltonian operator describing the Helium atom is given by 
(7.17). This may be regarded as an unperturbed system with Hamiltonian 

1m 2J 47re \r 1 r 2 

and a perturbation 

„2 



47re |ri -r 2 l ' 

Show that to first order, the energy of the ground state of the 
perturbed system is 

„2 



E. =■ 



(t)" 



-1 47re o a 
(Compare this with (7.27).) 

10 A perturbation ax 4 is applied to a one-dimensional simple harmonic 
oscillator (4.26). Show that the first-order change in the energy is 

|(2w 2 +2n + l)h 2 a/m 2 aj 2 . 

You may use the recurrence relation 

2yH„{y) = 2«tf„_ 1 (>>) +H n+l (y) 

where H„(y) is the Hermite polynomial of degree n, and also the 
orthogonality relation 



j e-y 2 H n (y)H m (y) dy = 2"n! v^5, 
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1 1 The Schrodinger equation for a plane rigid rotator with a uniform 
electric field S applied in the plane is 

h 2 d 2 ^ 
~2I d* 2 - ^ + M ^ cos *)^=0 

where / is the moment of inertia, ju is the electric dipole moment and <j> 
is the angle of rotation. Regarding the term -/u ^cos as a perturbation, 
show that the energy, correct to second order is 



2/ 



m 2 + 



Iix 2 S 2 



h*(4m 2 - 1) 



m = 0,±l,±2,.., 



Find an expression for the polarizability and deduce that the dipole is 
parallel to the field when m = 0. Comment on the results for the states 
m = ± 1 . (Hint: remember this is strictly a degenerate problem.) 

1 2 Show that the eigenfunctions for a free-state electron moving along 
the x-axis, and satisfying the periodic boundary conditions 



ip(x+L) = ^(x) 
are 






Hx) = —= exp 


~.,2n ' 

il — X 

L 


I integer 


with energies 


h 2 l 2 

£°(0- • 
2mL 


Suppose a periodic potential 


V(x)=V(x+a) 


with 


L=Na N integer 


is introduced as a perturbation. If the mean value is zero 


V(x)= 2 V„e 

n *0 


- in T x 


with V* = -V_ 



Show that to second-order terms, the perturbed energy is 



i?(/)=£°(/) + 2 -5 



n#0 



E v (l)-E°(l-nNY 



Explain why this result is not valid if 7 = nNjl. 
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In this case show, that to first order 



'(tMtH 



1 3 The 'phase' of the wave-function (7.98) is 

X 

tf>= j"fcdx-7r/4. 

a 

Explain why the quantization condition (7.101) implies that the wave- 
function has n and only n zeros, (nodes). 

By considering the quantization rule (7.102) for adjacent levels of 
laTge n and using the classical relation 

dp 

where V is the velocity and E the energy, show that the difference 
between the neighbouring levels is 

bE=Hu> 

where co = 2n /Tand 7" is the 'period' of the motion. 
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CHAPTER 8 



Time-Dependent 
Perturbation Theory 



8. 1 Variation of Constants 

The time-independent perturbation theory was discussed in the previous 
chapter. Then the object was to find the changes in the eigenvalues and 
eigenfunctions of the stationary states of a system when a constant 
perturbation was applied. Suppose now that the perturbation is a function 
of the time. In this case the perturbed Hamiltonian operator is time- 
dependent and stationary states do not exist. The problem is now to find 
the time-dependent state function. 

Consider a system where the Hamiltonian can be written 

Jf =JJf°+X/£0) 0<X<1 (8.1) 

where 3f° is independent of time and A(t) is a function of time. It is 
assumed that the energy associated with 4(f) is small compared with 
that associated with 3tf ° . As before the parametric value X = 1 corre- 
sponds to 'complete' perturbation and X = to zero perturbation. 

JV must be chosen so that its eigenfunctions are known. Let the 
orthonormal set {x&} be these time-independent eigenfunctions. 

*°X*=.E*X*. (8.2) 

3^° is assumed to possess a discrete spectrum. 

The general solution of the unperturbed time-dependent wave 
equation 

jf0^o = _^^_ (83) 

i bt 
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can be expressed as a linear combination of the variable separable 
solutions (3.15), i.e. 

oe 

*°= 2a k exp[-tE$t/h] Xk . (8.4) 

fc = i 

The flfc's are constants. 

The perturbed time-dependent wave equation is 

h a* 
{je°+\Ayir = -~. (8.5) 

I at 

At any instant, ^ can be expanded as in (8.4) but the values of the 
expansion coefficients will no longer be constants in time. 

* = 2«*(0 exp [-iESt/h] X *. (8.6) 

k 

Substitute (8.6) into (8.5) and with the help of (8.2) it is easily shown 
that 

X 2a k A e-^'\ k = - * S^f- e-*i<l\ k . 
k i £? dt 

k 

Multiply this last equation by e'%/" x * and integrate over the space 
co-ordinates to obtain 

X 2a k h„S<*-'*>«* = --^ /- 1.2. . . . (8.7) 

k i at 

where 

hik(t)=\x*Ax k dT. 

The equations (8.7) are the basic equations of the method of variation 
of constants and determine the expansion coefficients. However to solve 
these equations it is necessary to use perturbation theory. As in the 
previous chapter (7.49) it is assumed that the wave-function is a con- 
tinuous function of X so that it is possible to expand a k as a power series 
inX 

a k =a k '+\a k 1) + .... (8.8) 

Equation (8.8) may be substituted into (8.7) and the resulting 
equation regarded as a power series in X. The zero-order term gives 

'^dT' /=1 > 2 >--- ( 8 - 9 > 
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showing that the zero-order expansion coefficients are constants. The 
first-order term gives 

ih^-- ZaJPhttM-*'**!* l=l,2,.... (8.10) 

dr k 

The general power X" yields the higher order perturbation coefficients 

ih ^ = 2 ai"- l \ k e'rt -**></» / = 1, 2, . . .. (8.1 1) 
dr k 

The zero-order coefficients, (8.9) do not change with time and their 
values are specified by the initial state of the system. Suppose the system 
is in the rth stationary state of the unperturbed Hamiltonian when the 
perturbation is introduced at t = 0. Then 

a/°> = l af = f*i. (8.12) 

The first-order changes in the expansion coefficients are then obtained 
by solving the equations 

ih ^=h fi e '(*> -*?>'/» /= 1, 2, . . .. (8.13) 

At 

The solutions satisfying the initial conditions a} 1 ' = are 

t 

af\t) = - £ j" WO e i < E >- E °>»' h dt /= 1, 2 (8.14) 

o 

In the simple case that the perturbation does not depend explicitly on 
the time but is switched on at time t = and switched off at r = t', the 
solutions of (8.14) are 



-° r° \Ja 



apV) = h ft e0 _ e o f* i 

aP(t') = -i^t'. (8.15) 

These values remain unchanged for t > t after the perturbation is 
switched off and the wave-function which for t < was X/ exp [—iEptfh] 
is 

*= [1 +«JV)]«- ar?,/ *X/+ 2 aW)*-"^/ *>*' 

where the coefficients are given by (8.1 5). If the set {%/} is orthonormal 
then this wave-function is normalized to first order. 
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The product tf} 1) *(f)ap ) (Y) gives the probability of finding the system 
in the state x/ exp (-ikft/h) after a time t. 

la f (r)l -'*/»' — i e ?-e?\ 2 o<t<t'. (816) 
\ 2h 

The system may be considered to make transitions into all possible 
states. The expression (8.16) is periodic in time and the time At of a 
transition is of the order of h/(Ef - E?). Strictly Ef is not an eigen- 
value of the perturbed Hamiltonian but if the perturbation is small 
enough it may be taken as a system eigenvalue. If the energy change 
(Ef - E?) is significantly different from zero the time spent in the upper 
state is small. This is to be expected as in such transitions energy is not 
conserved and the change in energy AE = (Ef - E?) and the time 
available for measurement must be consistent with the uncertainty 
principle, 

(AE)(At)~h. (8.17) 

Energy is conserved within the limits set by the uncertainty principle. 

At time t > t' after the perturbation is switched off the probability 
of finding the system in the exact eigenstate x/ exp (-iE ft/ft) is simply 
(8.16) with t replaced by f '. If the matrix element hf t is zero the tran- 
sition from the ith state to the/th state cannot take place by a 'first- 
order' process. However, if A// and hfi are both non-zero then this 
transition may take place by a 'second-order' process through the 
'virtual' state/. The details of this depend upon the second-order change 
in the wave-function and will not be dealt with here. 

The expression for \a f l) (t')\ 2 may be written 

, (1 )„N|2 _ IM 2 sfrV) .,. (Ef-E?) 

14^)1 ~p p- «* *~ aT~- 

The function sin 2 (xt')jx 2 is sharply peaked about x = with zeros at 

xt' = ±nn, n = 1 , 2 This is another expression of the result that 

energy is conserved in transitions, the most probable transition being 
those for which Ef ~E?. If the perturbation is applied for a very small 
time t' then \a.f^(t')\ 2 is proportional to r' 2 and the transition prob- 
ability per unit time is not a constant. This undesirable result is avoided 
if it is assumed that there is a continuous distribution of states. 

Suppose it is desired to find the probability that a system originally 
in the fth state will finally be found in the continuous range 

5/ 8/ 

/-y</</ + ^- 
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The probability of finding the system in one of these states after the 
perturbation is switched off is 



J" 



14V)! 2 d/ (8.18) 

8/ 

/is a continuous variable (eigenvalue) that defines the state uniquely. 
This can be written 

\p{E?)\a?\t')? lE f (8.19) 

BE 

where p(E°) is defined so that p(E° ) AE f is the number of states in the 
energy range Ef + d£y and the range of integration bE corresponds to 8/ 

For f ' sufficiently large (so that 5E > fr/t') the transition probability 
has the properties of a delta function (see equation (3.36)) and (8.19) 
may be written 

^ j I h ri \ 2 p(E?)t>(E? - E?) d£> = j t' \h fi \ 2 p(E?). (8.20) 

SE 

This result assumes that the energy range 8E includes E? and that p(E$) 
is a slowly-varying function. This probability is linear in time and the 
transition probability per unit time P, that the system undergoes a 
transition from the initial state i to one of the final states is 

P = fp(E?)\h fi \ 2 (8-21) 

P is a constant as expected and equation (8.21) is Fermi's golden rule. 

8.2 Semi-Classical Treatment of Radiation 

In this section the interaction between a material particle and electro- 
magnetic radiation will be discussed. The particle will be assumed to 
obey quantum mechanics and the radiation to satisfy the classical Maxwell 
equations. Clearly such a semi-classical hybrid of quantum and classical 
mechanics cannot be completely satisfactory and the radiation field should 
be quantized and described in terms of photons. Such a treatment is 
beyond the scope of this book and the interested reader is referred to 
Introductory Quantum Electrodynamics, by E. A. Power. 

Classically, an electromagnetic field can be described by the scalar and 
vector potentials <j> and A with 

9A 
B = curlA and ^=-grad0-— (8.22) 

ot 

where B and E are the magnetic induction and electric field vectors 
respectively. The vector field A can be resolved into an irrotational and 
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a solenoidal part. The first of equations (8.22) defines the solenoidal 
part only and in free space the vector A can always be chosen to satisfy 

1 deb 
divA+^--^=0 (8.23) 

c 2 at v ' 

c is the velocity of light. 

It has already been shown (7.73) that the Schrodinger equation for an 
electron of charge — e and mass m in an electromagnetic field is 



h 2 e e e 2 A 2 

— — V 2 — — /ftA.grad — fh div A + — — - e<!> 

2m m 2m 2m 



V 



3* 
"»~ (8.24) 

More than one set of scalar and vector potentials describe the same 
electromagnetic field. If x is a solution of the wave equation 

V 2 X-^y=0 (S-.25) 

then new potentials 

A' = A-gradx <t>' = <t> + j; (8.26) 

at 

can be defined which still satisfy (8.22) and (8.23). Such a transformation 
is called a gauge transformation. In regions where there is no free charge 
the scalar potential must also be a solution of the wave equation (8.25) 
and in this case x can always be chosen so that the scalar potential is zero 
and from (8.23) div A = 0. 

With this choice for A and <f> and if the field is weak so that the term 
in^4 2 can be neglected, the SchrQdinger Hamiltonian becomes 

h 2 e 

-—V 2 -— /ftA.grad. 
2m m 

If the electron is bound to an atom it is necessary to introduce an 
additional potential energy term V to represent the central field and it is 
then appropriate to use the reduced mass jx for the electron. The 
Hamiltonian is then 

fi 2 e 

jr=- — V 2 + V--iHA.gmd. (8.27) 

2fi n 

This Hamiltonian can be written 
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where tf° is the Hamiltonian for an electron in a central field and 4 is 
the perturbation operator 



e _ e 

A = — — ih\. grad = + — A . p 



(8.28) 



where p is the operator representing the linear momentum of the electron 
For a monochromatic plane wave, the real vector potential is 

A = (A e /a )e i(k - r - u;f) +(Aoe-'" a; )e-'' (k - r - w ' ) (8.29) 

where A is a constant real vector and co is the angular frequency. To 
satisfy div A = it is necessary for 

A .k = 0. 

That is, the plane of polarization of the wave is perpendicular to the 
direction of propagation. 

The expansion coefficients can be found using the result (8.14). If the 
perturbation is switched on at t = when the system is in the unperturbed 
atomic state i then to first order 



a ( /\t) = - 



(U fi - u) fl (W/,- + <o) 



(8.30) 



where 



*>- 



Ja 



fltl 

e 



1 M 



f** 



ik.it 



'0-PX/dr 



-ik . r 



Ao-PX/dT 



and 



If (co/i ± w) is not close to zero then aft\t) oscillates very rapidly and 
the time spent in the state / is very small. Following the argument from 
(8.14) to (8.16) the probability of finding the system in the state / at a 
time t is 

+ (product terms) . (8.31) 

In all cases the product terms in the expansion for \a^(t)\ 2 are small 
and can be neglected. 
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The probability of finding the system in the state / at the time f is 
appreciable only if the frequency of the incident radiation is such that 

cjfi — co — or alp + co — 0. 

These are the cases of resonance. At most one of these conditions is 
satisfied for a given frequency and two given energy levels. Transitions 
between the states i and /are probable only if 

E?-E?=±±hu. (8.32) 

The positive sign applies when the particle absorbs the energy too from 
the field. If (8.32) is satisfied with the negative sign the particle loses 
energy ftco to the field. 

In atomic spectra the initial and final states are discrete and the 
transition probability per unit time would appear to be time-dependent 
(c.f. (8.16)). The difficulty is overcome by remembering that for a light 
source the radiation is produced by the random vibration of atoms that 
are widely-spaced. In practice the radiation is not strictly monochromatic 
but covers a small but finite range of frequencies 

5co _ _ 5co 
to-— -<co<co +— , 

2 2 

the frequencies having random relative phases. In this case interference 
effects can be neglected and the contributions from the different 
frequencies are additive. 

Consider the case of absorption where energy is absorbed from the 
field. In this case co/i — co and only the first term in (8.31) need be 
considered. The total probability is obtained by integrating this term 
over the frequency range 5co. The delta function properties of (8.31) 
give the probability for absorption (upward transition) to be for t 
sufficiently large, 



J" \a ( f\t)\ 2 dco = 27rf J \ti fl l 2 5 (co /f - - co) dco. 



Su 8u 

The integral is independent of time and so the transition probability per 
unit time is a constant. 



P=2tc j" \h' f( \ 2 8(o> fi -oj)du). 



6cj 



The magnitude of the vector potential A is a function of the 
frequency and this integral can be written 



P=-ftfA(o>fd\ \ Xf e*-*Po* dr| (8.33) 
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where p represents the momentum component along the direction A . 
The use of the delta function is equivalent to assuming that Aq(oS) is a 
slowly varying function of to. 

The transition probability depends upon the intensity of the radiation 
at the 'resonance' frequency. At any time instant the rate of flow of 
energy per unit area is given by the Poynting vector 

N = «?AH. (8.34) 

For a plane wave in free space 

Mo 

where E is the magnitude of the electric field. The value of |N| associated 
with the frequency w and averaged over one oscillation of period 27r/co 
is (8.22), (8.29), (8.34) 

— Al{w)=E(u) 
Moc 

where E(<J) is intensity of radiation at frequency to and /u is the 
permeability of free space. From (8.33) 

P= fjt^T *(«/,)|{x;e*- , PoW dr| 2 (8.35) 

where E? =* E? + ftto. 

Consider now the case of emission where the energies are such that 
CO/; — -co, i.e. 

Ef^E? -ft to. 

The second term in (8.30) is now dominant and the probability for the 
downward transition from /" to / is simply (8 .35) with e ,-k • r replaced by 
e -ik.r jjjjj i s t h e r esult for emission induced by the presence of a field. 
Note that not only is the absorption probability proportional to the field 
intensity as expected but that the stimulated emission probability is also 
proportional to the intensity. Also the probabilities for induced upward 
or downward transitions between a given pair of states in a given field 
are equal and proportional to the integral 

|J^e' k - r PoX/dr| 2 . (8.36) 

This is an example of the principle of detailed balancing. In an upward 
transition a photon (a quantum of radiation energy) is absorbed from the 
field and in a downward transition a photon is emitted by the particle. 
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\he electric-dipole approximation 

Tie exponential in (8.35) can be expanded 

e 'k-'=l +/ k.r- i -^- +.... (8.37) 

2 

[he wave-length of visible radiation is much greater than atomic dimensions 
bid in the vicinity of the atomic system where the wave-functions X/ and 
it are appreciable, k . r is much smaller than one. 

In the dipole approximation (8.35) is evaluated with e' k • r replaced by 
unity and the induced probability per unit time is proportional to the 
square of the integral, 



jx*PoXi dr. 



This approximation will not be valid for X-rays. Following an argument 
similar to that used in Chapter 5 (5 V 70) it can be shown that 



jXfPoXi dr = yt — J x* r xt dr 



where r is the component of the electron displacement r, relative to the 
origin, parallel to the direction of polarization. (— et) is the electric 
dipole moment of the electron. From the generalization of (5.47) 
(question 4, Chapter 5) 



d f * 
d7j X ' 



x}r X, &T= l - (£>° - E?) J x* f r Xi dr. 



n 

Finally, the integral in (8.35) becomes 
jXfPoXi dr = ifiWfl J x* r x t dr. 



The transition probabilities per unit time for absorption and induced 
emission are now 



■^(w//)! jx*>oXidr| . 



a 2 

If the radiation field is isotropic this becomes 



T/ioce 



2 



3h 



2~ E(o) f i)\ \x*rXi dr 



2 



(8.38) 



where r = ix + \y + kz and E(cjf j)/3 is the intensity in the x, y, z 
directions. The coefficient of the intensity in this formula is the Einstein 
B-coefficient of absorption or induced emission. 
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If the dipole integral fx} r Xi dr is zero for a pair of states the tran- 
sition probability vanishes in this approximation and the transition is 
said to be forbidden. In fact the transition probability may not be 
exactly zero as there may be contributions from the other terms in the 
expansion (8.37). 

If the expression (8.35) vanishes the transition is said to be strictly 
forbidden. Even in this case, however, the transition may take place by i 
second-order transition. The reader is referred to Quantum Mechanics, rJ 
Schiff, p. 254, for a deeper treatment. 



Spontaneous emission 

Even in the absence of an external field there is a chance that an atom in 
an excited state with energy E t will make a transition to a lower energy 
state Ef with a corresponding emission of radiation energy. It can be 
shown (Schiff, Power) that the transition probability per unit time for 
such a spontaneous emission is again proportional to the square of the 
modulus of the dipole integral, i.e. 



|jx?rx,-dr| 2 . 



Selection rules 

The conditions which the functions x» and Xf must satisfy to prevent th< 
dipole integral (8.38) from vanishing are called 'selection rules'. Note 
that if both functions are spherically symmetric (atomic-states with / = 
the integral vanishes and the transition is forbidden. Similarly, using the 
hydrogen-type wave-functions it can be shown that the electron transitic 
is 'forbidden' unless the changes in the / and mi quantum numbers satisfy 

8/ = ±l 
and 

bm t = or ± 1 . 

This is dealt with in more detail in Chapter 1 1 . 

In Chapter 6, problem 14 it is shown that the matrix elements oix nm 
with respect to the harmonic oscillator eigenfunctions are zero unless 
n=m± 1 . This implies the selection rule 

5n=±l 

so that the spectrum should consist of a single frequency equal to that o 
the oscillator. This has already been mentioned in Chapter 4 with referer 
to the vibrations of diatomic molecules. 
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Line breadth 

The uncertainty principle can be expressed in the form 

(At)(AE)>n/2 

where (At) is the uncertainty in the time measurement and (AE) is the 
uncertainty in the energy. 

The probability per unit time X for a transition from a state i to a 
(lower state/ of a system is independent of the time. Such a time- 
independent process obeys an exponential law of decay and 1/X is of the 
order of magnitude that the system stays in the upper-state. Clearly a 
determination of the energy of this upper state cannot take longer than 
the lifetime 1 /X of the state and so the uncertainty in the energy is 

(AE)>fi\l2. 

This uncertainty in the energy corresponds to a broadening of the 
emitted spectrum line (uncertainty in the angular frequency of X/2) and 
in general, the longer the lifetime of the state the sharper the spectrum 
line. This broadening imposed by the uncertainty principle is very small 
and is of the order of that observed although there are other factors such 
as radiation damping which affect the observed line width. 



8.3 Elastic Collisions 



Scattering cross-section 

Suppose a parallel stream of non-interacting particles impinges upon a 
fixed target. Let the density of incident particles be such that n particles 




Incident Target 

t Incident 
beam 



Figure 8.1 
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cross a unit area perpendicular to the flux in unit time. The incident 
particles will be scattered due to the interaction with the target and the 
number of particles scattered into an element of solid angle 5 £2 will be 
proportional to n8 J2. The constant of proportionality a has the dimen- 
sions of area and is called the differential cross-section. It is not a 
constant but depends upon the polar co-ordinates (0, 4>) of 5 £2 relative 
to the incident flux. 

o(0, 0) is the probability per unit time for a particle to be scattered 
into unit solid angle about (6, 0) when a beam of unit flux bombards the 
target. The total cross-section is defined to be the integral over the solid 
angle 

2it n 



Ja(0, 0) dft = J ( o(0, 0) sin d0 d<j>. 



o o 



Quantum theory of scattering 

The result of a collision in classical mechanics is determined by 

(a) the initial particle velocities; 

(b) the distance at which the particles would pass if there was no 
interaction; 

(c) the interaction potential between the particles. 

Since the concept of particle trajectory loses its meaning in quantum 
mechanics a different approach is required. 

The quantum mechanical Hamiltonian representing a system of two 
particles whose interaction depends only on their relative displacement 
commutes with the operator representing the total linear momentum of 
the system. So the expectation value of the momentum in any state that 
satisfies the Schrodinger equation is a constant. linear momentum is 
conserved in collisions in both quantum and classical mechanics. 

In Chapter 4 it was shown that if the potential in a two-body problem 
depends only on the relative displacement of the particles the Schrodingei 
equation can be split up into two, one of which describes the motion of 
the centre of mass and the other describes the motion of a single particle 
with reduced mass u in the field due to a fixed centre of force. 

The scattering problem can be reduced to a single-body problem by 
choosing the centre of mass co-ordinate system in which the centre of 
mass is always at rest. Experimentally a more obvious system is one in 
which the target particle is initially at rest. This is the laboratory co- 
ordinate system. The differential cross-section is not the same in both 
systems. However the total cross-section is obviously the same in both 
systems as it refers to the total number of particles scattered in all 
directions. 

The centre of mass system is used below. 
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Consider a collision in which an incident particle is scattered by a 
target. If the 'internal energy' of the two particle is left unchanged, the 
collision is said to be elastic. In the centre of mass system, the problem 
is to solve the equation 

- ^ V 2 >Kr) + K(r)<Kr) =£->Kr) 

where E is the energy of the relative motion. The precise form of the 
scattering potential V(r) may be quite complicated and consequently 
most collision problems cannot be solved analytically. However it is not 
difficult to show that at large distances from the scatterer the wave- 
function will take the form 



ikr 



e'«*+f (0, 4>)—. 
r 

The first term represents a particle incident along the positive z-axis 
whereas the second term represents the scattered wave. The problem is 
to find f{6, <t>). 

Collision problems can be solved by time-dependent perturbation 
theory. In this treatment the short-range interaction is regarded as a weak 
perturbation which is zero initially and decreases to zero after the 
collision. Equation (8.21) will now be used to evaluate the transition 
probability when the initial and final states of the scattered particles are 
free-particle momentum eigenfunctions. 

The time-independent part of the wave-function of the incident 
particle is, when normalized to give unit incident flux 

x< = /Sr ex p( /k o- r ) ( 8 - 39 ) 

This represents particles with momentum hk . The wave-function of a 
scattered state is, using the 5 -function normalization for a continuous 
spectrum, 

Xfc = yr— ? exp (?k . r). (8.40) 

For elastic collisions the internal energy of the particle is left 
unchanged and 

k =k. 

The change of momentum of the incident particle is 

ft(k-k ) = -ftK. 
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The perturbation matrix element corresponding to the initial and final 
states (8.39) and (8 .40) is 



^• = y3^i F(r)exp(fK - r)dr 



(8.41) 



where the potential V(r) is due to the interaction between the incident 
particle and the scatterer. This integral depends in general on the direction 
of scattering. The probability of finding the scattered particle with wave 
vector in the range k + bk and in the solid angle 8£2 = sin 05550 after 
timer' is (8.18) 



Jltfty) 

5k 



\ 2 k 2 dk da. 



(8.42) 



Since E k = fi 2 k 2 /2fx then (8.20) this can be written 

(\h ki \ 2 (jJc2n/h 3 )8nt'. (8.43) 

Hence the number of particles scattered into the solid angle 6 O, per unit 
time is ((8.41), (8.43)) 

(WlKx" 1 "^ 

and so the differential cross-section is 

k 2 , 



a(6, 0) = 



2i[% 4 



V(r)e' 



,1'K.r 



dr 



(8.44) 



This is the result of the first Born approximation and will be valid if 
the interaction potential V(r) is small compared with the kinetic energy 
of the incident particles. 




Figure 8.2 
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The simplest scattering potential is that of the spherical well 

V(j) = -V r<a 

= r>a. 

In this case the polar axis can be conveniently taken along the 
momentum transfer direction K. 

JF(r)e ,K - ,[ dT= V J j J e iKrcosli r 2 sinpdy d0dr 



4jt 



= - — K Jsin(tfr)rdr 



= + 4 4v« 



a 1 

— cos Ka — — j sin Ka 

K K 



K can be expressed in terms of the scattering angle 

K = 2Ar sin 0/2. 
The differential scattering cross-section is 



ff (") = 1 h 2 



q 3 \ 2 / sin u - u cos u 



where 



u = Ka = 2k Q a sin 



6 




Figure 8.3 
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g(«) 




4 6 

u (radians) 

Figure 8.4 Differential scattering cross-section for a spherical potential-well. 

This extremely brief treatment of the theory of collisions has been 
given primarily to illustrate the time-dependent perturbation theory. 
The reader is referred to Burhop's review article 'Theory of Collisions' 
in Quantum Theory, Elements, edited by D. R. Bates, for a fuller 
treatment. 



PROBLEMS 

1 In the laboratory co-ordinate system the stationary target mass m 2 is 
initially at and the projected mass m t has a velocity (0, 0, V). If after 
the collision, assumed elastic, the particles are scattered through 1 and 

2 with velocities Vj and V 2 (see Fig. 8.5) show that 

m 1 V 2 =m 1 Vi + m 2 V\ 

mi V = m 1 V l cos0! +m 2 V 2 cos0 2 

m\V x sin0j =m 2 V 2 sin0 2 - 



_^l?i 



"«"--vfr 



z-axis 




Figure 8.5 
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Show also that the centre of mass moves with a velocity (0, 0, V ) where 
V =m 1 V/(m 1 +m 2 ). 

In the centre of mass system the total linear momentum is zero and 
the momenta of the particles are the same in magnitude but opposite in 



r 



/ 
/ 



u 



/OT 2 



/ 
/ 
/ 
/ 
/ 
/ 



Figure 8.6 



sign. In Fig. 8.6, C is the fixed centre of the mass, u 1; U2 are the initial 
velocities and Vi , V2 are the final velocities. Explain why for elastic 
collisions V[ - u x and V 2 = u 2 and show that 

Vi sin 9 1 =Uy sin 6 

V l cos0 1 =«i cos 6 + V . 

Hence show that 

m 2 sin 8 , , 

tan0!= and d 2 ={(*-0). 

m l +m 2 cos 

(Hint: observe that Vj is the resultant of V J and the centre of mass 
velocity (0, 0, V ).) 

2 Let a and a be the differential cross-sections in the laboratory and 
centre of mass co-ordinate systems respectively. Then 

a(0,)dft = o'(0)dft' 

where dCl and dSl' are the corresponding respective solid angles. Show 
that (see problem one) 

(1 +ucos0) 

sin^dfl^- — V - ^ 7TV2- sin0d0 

1 (1 +ju 2 +2jucos0) 3/2 
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where n = m 1 lm 2 and hence show that 

(l+M 2 +2 M cosfl) 3/2 
(j(0 1) = a (6) ; — . 

v u w |1 +JUCOS0I 

3 Maxwell's equations for the electromagnetic field are 

dB 3D 

curl <£■=-— - curlH=j+ — 

at at 

div D = p div B = 

with 

H = e r e S B = MrMoH 

where e r , n r are the relative permittivity and permeability of the medium. 
Show that the electric and magnetic induction vectors can be expressed 
in terms of the scalar and vector potentials <j> and A such that 

9A 
S = — —-- grad #, B = curl A (i) 
at 



with 

at 2 e r e 



v a, _«&**.__£_ (ii) 



and 

V 2 A - '- _ 

c 2 dt 



_, \i r e r 9 2 A . „.. N 

V 2 A - — y 7T = -MrMoJ (in) 



provided 

M r e, 9$ 
div A = — — 5- — . 
c 2 dt 

More than one set of potentials specify the fields. Show that equations 
(i), (ii) and (iii), are invariant under a change to the new potentials 

A' = A - grad x, 4>' = <P + "gT 



provided 

This is called a gauge transformation. 



c 2 dt 2 
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4 (i) Prove the vector identity 

div (<f A H) = H . curl € - & . curl H. 

(ii) Write down Maxwell's equations for free space (problem 3) and show 
that 

H . curl S- S. curl H = - — (hlioH 2 + \e <? 2 ). 
at 

The energy density of the electromagnetic field is 

\noH 2 +^e S 2 . 

Use the divergence theorem to show that the flux of energy out of a 
closed surface s is 



>N.nds 
s 

where n is the unit outward normal and N is the Poynting vector 

N=«?AH. 

5 The commutator for two quantum operators p, A is related to the 
corresponding Poisson bracket of the two dynamical variables p, A by 
(3.6) 

[p,A]=/ft{p,A}. 

Show that if p is the momentum operator and A is a function of the 
position only then 

[p,A] = -# div A. (i) 

(Hint: use equation (1.28).) 

The classical Hamiltonian for an electron in an electromagnetic field is 

2 2 

H = — + — (p.A + A.v) + ^—A 2 -e<j>. 
2m 2m VF v ' 2m 

(Chapter 3, problem 2). Explain how equation (i) can be used to obtain 
the correct quantum Hamiltonian of (8.24). 
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CHAPTER 9 



Group Theory 



9.1 Symmetry Operations 

Many physical systems possess symmetry properties which have 
important consequences for the occurrence of energy levels. For 
example, crystalline solids have a periodic structure and this translational 
symmetry enables the energy eigenfunctions to be classified by a wave 
vector k. These functions are the Bloch waves and will be referred to 
again later in the chapter. 

Although finite molecules cannot possess a periodic structure they 
can have other symmetry properties, such as reflection planes and 
rotation axes. 



Figure 9.1 Plane of symmetry. 
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A plane of symmetry is a plane in the system such that each point on 
one side of the plane is the mirror image of a corresponding point on 
the other. 



Mirror 
plane 



-xf 



Inversion 
axis 



-x/>' 



Figure 9.2 

An axis of symmetry is an axis such that the system takes up an 
identical position on rotation through a suitable angle about it. The 
angle of rotation must be of the form 2tt/« where n is an integer and 
such an axis is called an w-fold axis. 

A centre of inversion is a point such that the system is invariant 
under the operation r -* — r where r is the vector displacement of any 
point in the system referred to the inversion centre. 

A system possesses a rotation-inversion axis if it is brought into self- 
coincidence by a rotation followed by an inversion. 

Some symmetry operations are equivalent. For example, a two-fold 
inversion axis is equivalent to a mirror plane perpendicular to the axis 
(Fig. 9.2). 



Symmetry operators 

Mathematically, a symmetry operation such as a rotation or reflection 
involves a linear transformation of the co-ordinates. There are two 
distinct ways of thinking about such a rotation. A physical rotation of 
a body through an angle a in the clockwise direction about an axis, say 
the x 3 axis, is equivalent to a rotation of the complete axes system 
through the same angle but in the anti-clockwise sense in that the 
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relation between the new and old co-ordinates of any point in the body 
is the same in both cases (Fig. 9.3). In each case 



*i =Xi cosa—x 2 sin a 

x 2 =Xi sin a + x' 2 cos a 
_ / 

*3 ~ x 3- 



(9.1) 



where x t , x 2 , x 3 and x u x 2 , x' 3 represent the old and new co-ordinates 
respectively. The first point of view in which the system is imagined to 
undergo an actual rotation is called the 'active' viewpoint and is perhaps 
the easiest to interpret. However, mathematically it is far more convenient 
to use the 'passive' viewpoint of axes rotation and this will be done here. 



.P(Xl,Xl,xJ 



.P'(x'i,x' 2 ,x' 3 ) 



I 



I / 



' Y 



/ 



/* 



(a) 

Figure 9.3 (a) Rotation of P to P 1 in a clockwise sense, 
(b) Rotation of axis in anti-clockwise sense. 




A general rotation, reflection or inversion can be represented by a 
linear relationship of the form, 

*'l =<*11*1 +<*12*2 +ai3*3 
X 2 =a 2 l^l +<*22*2 +a 2 3^3 



x 3 =a 31 xi +a 32 x 2 +0:33X3 



(9.2) 



where Xi,x 2 ,x 3 are the old co-ordinates of a point and x\,x ' 2 ,x' 3 are 
the new co-ordinates of the same point. Equations (9.2) can be 
represented by the simple matrix equation 



: ax 



(9.3) 
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where a is the matrix 

<*21 <*22 a 23 
«31 a 32 a 33y 

and x and x' are the column vectors 

'"\ * ('■ 

x 2 I and I x 2 
K x 3 J \x' 3j 

respectively. 

For an inversion, a rotation about an axis, or a reflection in a plane 
through the origin, the distance of a point from the origin remains 
constant. 

r*=*5+*l+xl =*?+*?+*?• (9.4) 

The matrix o is an orthogonal matrix and its determinant takes the 
value ±1 , the negative sign belonging to an operation involving an odd 
number of reflections. 

The effect of any operator on a function /(x) is to transform it into 
another function. For example the differential operator d/dx changes 
x" into nx"' 1 . It is now necessary to define the effect of a symmetry 
operator A on an arbitrary function. In the passive convention, the co- 
ordinate axes are rotated and the value of the function at each point in 
space remains unchanged. The transformed function is easily obtained 
by replacing the old co-ordinates by the new co-ordinates according to 
(9.3), i.e. 

/l/(x) = /(a- 1 x') (9.5) 

where a is the transformation matrix corresponding to the operator A. 
It is convenient to drop the dash and 

^/(x)=/(a" 1 x). (9.6) 

The operator is a substitutional one and is linear. 
As an example consider the two functions 

di (x, y) = 2xyf(r), d 2 (x, y) = (x 2 - y 2 )/(r). (9.7) 

The operator representing a rotation of the axes through ff/4 radius in the 
anti-clockwise about the z-axis may be written 8 Z . The old co-ordinates 
are given in terms of the new by (9.1) 

x=—j=.{x-y\ y=-y=(x+y), z = z 
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a = 



V2 




8 z d 1 (x,y)=2x± (x' - /)(*' +/)/(/) 

=<y 2 -/W) 

= d 2 {x',y') 



(9.8) 



The function d x evaluated at the point (x, y) is equal to the value of d 2 
at the same point with new co-ordinates (x',y'). 



9.2 Groups 

A group is a set of mathematical elements {^4} which obey certain 
specified relations. In this book the elements will be taken to be sym- 
metry operators but the concept is much more general. The members 
of a group have the following properties. 



/>:: 



P" 
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Figure 9.4 
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f. Closure property 

tt must be possible to combine the elements in pairs to form a new 
dement of the group. That is, the group is closed with respect to some 
[binary) operation, e.g. 

A=BC. (9.9) 

In this book the group elements are transformation operators and 
(9.9) means that if the operator C is applied first, and B second, the 
result is equivalent to applying the operator A alone, where A is another 
group element. In general the order of the operators is most important. 
Consider a point P on the y-axis together with a plane of symmetry m x 
in the yz plane and a four-fold rotation axis lying along the z-axis. 
If the operator m x is applied first, followed by an active-clockwise 
rotation 4z, the point P moves to P'. However if the order of the 
operations is reversed, P moves to P". The two operations do not 
commute. 

However, any two rotations about the same axis do commute. 

2. Associative property 

The group operation is associative and the combination of three 
operators is uniquely defined, i.e. 

A(BC) = (AB)C = ABC. (9.10) 

3. Identity 

One of the group elements is the identity or unit element E such that it 
commutes with all other members of the group, i.e. 

EA=AE = A. (9.11) 



4. Inverse 

Each element A in a group has an inverse A _I so that the combination 
of an operator with its inverse, results in the unit operator, i.e. 

AA' 1 =A~ 1 A=E. (9.12) 

The group 3m 

Consider a right pyramid with an equilateral triangular base and its 
central vertex above the centroid of the base. The pyramid is not a 
regular tetrahedron so that the three faces meeting at the central vertex 
are isosceles and not congruent to the base. This is the structure exhibited 
by the ammonia molecule NH 3 with the three hydrogen atoms situated at 
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The origin is at the 
centre of the triangle. 



Figure 9 .5 The ammonia molecule. 



the vertices of the equilateral triangle and the nitrogen atom at the 
remaining vertex. The group 3m consists of those symmetry elements 
which leave this pyramid invariant. This is also the symmetry group of ar 
equilateral triangle if the two plane sides are not identical. 

There are six symmetry operations which leave this system invariant. 
They are, 

(a) the identity operation E. 

(b) the three mirror planes of the type m v 

(c) the two rotations 3 Z , 3f corresponding to an active clockwise 
rotation of 120° and 240° about the z-axis respectively. 

These six operations may be shown to form a group, and Table 9.1 is 
the 'multiplication table' for the group. The fact that such a table can be 
constructed shows that the elements must form a group. 

Table 9.1 Multiplication table for the group 3m. 
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In Table 9.1 the elements in the top row are applied first and then the 
operation in the first column. For example, if the reflection m x is appliei 
and then followed by the rotation 3 2 , the result is equivalent to the 
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reflection m 2 - However, if 3 Z is applied first and then m x , the result is 
m 3 . This illustrates a point already mentioned that the order of the 
operations is important, as they do not necessarily commute. 



The group 6m2 

Suppose in the previous example the two faces of the triangle were 
identical. The system would then be invariant under twelve symmetry 
operations constituting the group 6m2. Apart from the six operations 
already discussed this group includes, 

(d) the three rotations of the type 2 y through 180° about the vectors 
rj, r 2 and r 3 respectively. 

(e) the mirror plane m z perpendicular to the z-axis and including the 
triangle. 

(f) the two operations 6 Z involving a rotation through 60° and 300° 
respectively, about the z-axis followed by an inversion through the 
origin. 

It is left to the reader to construct a multiplication table for this 
group. 

This symmetry group is of importance in that it is related to that of 
an ion surrounded by three molecules of water in the hydrated crystal 
of a salt. The ozone molecule has almost this symmetry. 

9.3 Matrix Representation 

The reader was introduced to the concept of a function space in Chapter 
3 (definition 6). A function space {0,-} is said to be invariant under the 
set of operators {.4} if all transformed functions of the type Adj belong 
to the space. The Cartesian vector space, defined by the unit vectors 
{ej, is invariant under either of the two symmetry groups 3m or 6m2. 

If the space {0,-} is invariant under the group {A} the transformed 
functions may be expressed in terms of the original set, i.e. 

A6 i = Za mt 8 m . (9.13) 

m 

In matrix notation Ad = 8a where 8 is a row vector. The matrix a 
represents the effect of the operator .4 on the basis functions {0,-} and 
each operator in the set has a matrix representation. If B is another 
operator in the set then 

56 = 611 (9.14) 

The effect of the multiple operator AB is easily seen to be 

ABe^IZajnPmid,- (9.15) 

/ m 
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if the operators are linear, i.e. 

ABQ=Qa$. (9.16) 

The matrix corresponding to the product AB is the product a|3of the 
separate matrices representing A and B. 

The matrices obey the same group multiplication table as the | 

operators themselves and are said to form a representation of the group. 
The order of the matrices gives the dimensionality of the representation. 

The set of functions {0,} can be used to define the same vector space 
as {dj} if the number in each set is the same and if the new basis functions 
can be expressed in terms of the old by some linear transformation, i.e. 

»/ s 2V'- ( 9 - 17 ) 

In matrix notation, 

e' = 6S (9.18) 

where 8' and 8 are row vectors. Conversely 

8=6'S _1 . (9.19) 

The representation of the operator A in the new basis is given by 

AQ' =A9S 
= 8aS 
= 8'S _1 aS. 
That is, the new basis functions transforms according to the representation 

S _1 ctS. (9.20) 

This new representation obtained by this similarity transformation is said 
to be equivalent to the original representation a. A change of basis 
functions, as described above, is equivalent to choosing a new set of 
vectors {e,-} in Cartesian space related to the old set {e,} by some rotation. 

The matrices in equivalent representations corresponding to a given 
operator are not in general identical but their traces are equal (6.22). As 
the trace of a product of two matrices does not depend upon the order 
of the factors 

Trace (S _1 aS) = Trace (SS _1 et) = Trace (a). (9.21) 

Irreducible representations 

The function space {0,-} is said to be irreducible under the set of operators 
{A} if with any function x» in the space, the set of transformed functions 
{/4xi} defines the same space where A runs over all the members of {A} 
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in turn. The Cartesian space {e,} is irreducible under the set of three 
operators consisting of a rotation of 90° about the x-axis, and 90° about 
j'-axis and 90° about the z-axis. 

On the other hand, this Cartesian space is not irreducible under the 
set of operators consisting of the identity and rotations of ± 120° about 
the (111) axis. To see this, consider the unit vector along the (1 1 1) axis. 

A set of matrices representing a group of transformation operators 
with respect to a set of basis functions {0,-} is said to form an irreducible 
representation, if the function space defined by {0,} is irreducible under 
the group of operators. A reducible representation can always be brought 
into irreducible form by an equivalence transformation of the type 
(9.20). 

The reducible matrices representing all the elements of the group are 
then reduced to block form, all with the same block structure. That is, 
each matrix is a direct sum of the square matrices a lt a 2 , . . . along the 
leading diagonal and the dimensions of corresponding submatrices are 
the same for every operator in the group, e.g. 



*i 











a 2 











a 2 



A set of submatrices forms one of the constituent irreducible 
representations of the group. 

The functions xe~ r , ye~ r , ze~ r correspond to a reducible represen- 
tation of the equilateral triangle group 3m (Table 9.1 and Fig. 9.5). 

The matrices representing the different group operators are given in 
Table 9.2. 

As an example consider the effect of applying the operator w t 
representing the mirror plane perpendicular to the #-axis, 



-10 
m l {xe-~ r ,ye- r ,ze- r ) = {xe- r ,ye- r ,ze~ r )\ 1 



1 



(9.22) 



Inspection of the Table 9.2 shows that none of the six group elements 
mixes the functions xe~ r , ye~ r withze _r and the two sets (xe~ r ,ye~ r ) 
and (ze~ r ) span separate spaces which are invariant under the group 3m. 
There is no need to carry out a similarity transformation in this simple 
case. The two separate irreducible representations are listed under A 
andiT. 
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The letters most commonly used to represent a one-dimensional 
representation are A and B, E is used to represent a two-dimensional 
representation and Tto represent a three-dimensional representation. 

Another basis for the two-dimensional irreducible representation of 
the group 3m is (x + iy) e~ r , (x - iy)e~ r . These functions are a linear 
combination of the previous basis functions and so produce an equivalent 
representations of the group. The transformation matrix representing the 
linear relationship between the two sets of basis functions is 



H' -.') ™ ds " , - i (! 1} 



The matrices of the new equivalent representation are displayed in the 
last row of Table 9.2, e.g. for 3 2 




Every group has an identity representation, such as A, in which all 
the 'matrices' are unity. This is not a 'faithful' representation since the 
'matrices' obey more relations than the group elements themselves. 

Apart from the two irreducible representations already mentioned 
the group 3m has another one-dimensional representation. 

Table 9.3 Character table for the group 3m. 



3m 


E 


m x 


m 2 


m 3 


3 Z 


3 Z * 


^i 


1 


1 


1 


1 


1 


1 


A 1 


1 


-1 


-1 


-1 


1 


1 


E 


2 











-1 


-1 



The trace of a matrix representing an operator is called the 'character' 
of the operator in the representation and is independent of the choice of 
basis functions. 

Table 9.3 is a character table for the group 3m and lists all three 
irreducible representations. 

Classes 

The elements of a group may be partitioned into exclusive sets called 
'classes'. Given any element A of a group, the set of elements B~ l AB, 
where B runs over all elements in the group, is called a class. 

The repeated operation B~ l AB is the result obtained by first rotating 
the system to some equivalent position by the rotation B, next carrying 
out the symmetry operation/!, and then reversing the initial operation 
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by B~ l . The members of a particular class must be operators of the same 
type, such as rotations through the same angle, but performed about 
different axes which are related to each other by members of the group. 

As the characters of all the elements of a class in a given representation 
are identical (9.21), it is only necessary to specify the classes in a group 
character table. 

The six elements of the group 3m (Table 9.3) are divided in three 
exclusive classes, 
E 
m u m 2 , m 3 

3 Z .3 Z *. 
The identity operator always forms a class of its own. 

It can be shown that the number of classes in a group is equal to the 
number of irreducible representations of the group. For the equilateral 
triangle group there are three classes and three irreducible representations. 

There is a very useful relationship between the dimensionalities of the 
irreducible representations and the number TV of elements in the group. 
If the /th irreducible representation has dimension «,■ then 

2nf =N, (9.23) 

i 

where the sum is over all the possible irreducible representations. 
Consequently if the number of elements and classes in a group is known 
then (9.23) will give the dimensionality of the irreducible representations. 

The group 3m has three classes and hence three irreducible represen- 
tations. As the group has six elements then (9.23) becomes 

l 2 +1 2 +2 2 =6. 

There are two one-dimensional and one two-dimensional irreducible 
representations in agreement with Table 9.3. 



Axial rotation group 

The axial rotation group is the infinite group whose elements are all the 
possible rotations about some axis. Clearly a rotation through an angle 
0i followed by 2 > is exactly equivalent to <t> 2 followed by 4>\- Such a 
group in which all the elements commute with one another is called an 
Abelian group and each element forms a class in itself. As the number 
of irreducible representations is equal to the number of elements, (9.23) 
implies that all these representations are one-dimensional. 
If r(0O is the number representing the rotation <p t 

r(0i)r(0 2 ) = r(0 1 +^ 2 ). 

This equation has solutions 

r (m) (</>) = e' m0 . (9-24) 
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As a rotation through 2n is equivalent to zero rotation, if the function is 
single valued, 

; r( m >(27r) = r< m >(0) = 1 

and m is restricted to the values 

wi=0,±l,±2, .... 

A suitable choice for basis functions is 

^m(r,e,0) = f(r > e)e'"" t> . (9.25) 

9.4 Symmetry and Quantum Mechanics 

Many molecular systems and solids have symmetry properties which have 
important consequences for the solutions of Schrodinger's equation and 
are related to the occurrence of degeneracies. For the moment only one- 
electron systems will be considered and the Hamiltonian for a particle 
of mass m is, 

h 2 
Jf(x u x 2 , x 3 ) = - — V 2 + V(x lt x 2 , x 3 ). (9.26) 

2m 

If the old co-ordinates x it x 2 , x 3 are replaced by new co-ordinates 
x i, x' 2 , x' 3 (9.2), by some linear transformation, it is well known that the 
Laplacian is invariant. 

a 2 a 2 a 2 _ a 2 a 2 a 2 

3x 2 + bx 2 2 + 3x| ~ 3x1 2 + *x' 2 2 + 3x 3 2 ' (9 } 

Consider now a 'symmetry operation' that brings the system into self- 
coincidence. The potential function is invariant under the change in 
co-ordinates corresponding to this operation. 

As a simple illustration consider the potential of an electron moving 
in the field of two protons. This arises in the hydrogen molecular ion. 

A rotation through 180° about any axis perpendicular to and bisecting 
the line joining the two protons brings the system into self-coincidence. 
In particular the point P is rotated into P' and clearly the potential 
function has the same value at P and P', i.e. 

V(x 1 ,x 2 ,x 3 )=V(x' u x' 2 ,x 3 ) (9.28) 

where the co-ordinates (xi,x 2 ,x 3 )ofPare related to those of P' 
(xi,x 2 ,X3 by a linear transformation of the form (9.2). When the 
transformation is interpreted passively as a rotation of co-ordinate axes 
instead of as a physical rotation, (9.28) requires that the potential 
function be invariant under the transformation. 

The complete Hamiltonian is invariant under a linear transformation 
corresponding to a symmetry operation. 
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P(x l ,x 2 ,x 3 ) 



--X 

+ e 



P'(X' U X'2,X\) 

Figure 9.6 Hydrogen molecular ion. 

Suppose the co-ordinates undergo an arbitrary linear transformation 
corresponding to the operator A. If \K*i > x 2 , x 3 ) is any wave-function 

Ajf(x u X 2 ,X 3 )\lj(x u X2,X 3 )=J^Xxi,X2,x' 3 )\l/'(x'ux'2,x'3) 

= J^'(x' 1 ,X2,x' 3 )A^(XuX2,X3) 

(9.29) 
where •#" and \p ' are the transformed Hamiltonian and wave-function in 
terms of the new co-ordinates. If A is a symmetry operator 2%" is the 
same operator as .?f and (9.29) implies that the Hamiltonian commutes 
with A 

The important relationships between commuting operators and 
degeneracy was mentioned in Chapter 5 . If \p is an eigenfunction of Ji? 

Jtijj=Eip. (9.30) 

Transform this equation by operating on the left by the symmetry 
operator A 

A(3f\p) = Jtf(A\p) = E(A>l>). (9.31) 

In general the transformed wave-function 

\l/'(x' 1 ,x'2,x 3 )=A\ls(x 1 ,x 2 ,x 3 ) (9.32) 

is not the same as \p(x 1 ,x 2 , x 3 ) and they are two linearly independent 
wave-functions denoting degenerate states belonging to the same energy. 

A simple illustration is obtained by considering the eigenfunctions 
for a particle in box. From equations (4.42) and (4.43) the three eigen- 
functions 

W ,l,2) = sin 7 ^ + -|sin-^ + -jsin 7 ^ +I j 

■g I a\ 2ir a\ n ( a 
*(l,2,l)-in-^ + -j a „_^ + -j IB1 -^ + - 

^(2,l > l) = sin^(, + |)sin^^ + |]sin^z + f) (9.33) 
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are degenerate and have the same energy, 

E=6h 2 /8ma 2 . (9.34) 

In (9.33) the origin is taken to be at the centre of the box. \j/(l ,1,2) can 
be transformed into <p(l, 2, 1) or \p (2, 1, 1) by considering the symmetry 
rotations of 90° about the x and y axes. 

In general, each energy level possesses a set of degenerate eigen- 
functions which span a space which is invariant under the action of any 
group of transformation operators which commute with the Hamiltonian. 
Consequently the eigenfunctions produce some representation of the 
group. 

The important relationship between group theory and quantum 
mechanics is the postulate that the set of all degenerate wave-functions 
belonging to a given energy level form an irreducible representation of 
the group of all the symmetry operators of the Hamiltonian. 

As a general rule, this statement is correct, but sometimes not all 
degenerate eigenfunctions are generated by applying the symmetry 
operators to a given eigenfunction. These extra degeneracies are called 
accidental. A well-known example that has already been referred to is 
the case of the hydrogen atom (Chapter 4). 

The degeneracy between states with the same n and / quantum 
numbers is related to the spherical symmetry of the hydrogen atom and 
is discussed later in this chapter. However, the Coulomb degeneracy 
between the states with the same n quantum number but with different 
values for/, does not arise in this way although Fock (1935) has related 
this degeneracy to the symmetry of the Schrodinger equation in 
momentum space. Often accidental degeneracies can be explained by 
deeper investigation. 

9.5 Representative Operators and Symmetry Operators 

A question that springs to mind is 'what relation is there between the 
operators representing the observables and the symmetry operators of a 
system?'. In Chapter 5 the reader was introduced to the concept of a 
complete set of commuting observables. When the eigenvalues of each is 
specified, the wave-function is known completely, apart from a phase 
factor. In the present chapter it has been stated that all degeneracy, apart 
from accidental degeneracy, can be explained by the symmetry operators 
that commute with the Hamiltonian. Consequently there must be a 
correspondence between representative operators and symmetry 
operators. 

The simplest example is the relation between translational symmetry 
and linear momentum. The Hamiltonian for a single electron in a fieldless 
space is 

h 1 
2m 
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and the energy eigenfunctions are 

<//k(r)=e'' k - r . 
These are also eigenfunctions of the momentum operator 

z 

with eigenvalues ftk. 

Consider the one-dimensional case. A translational operator ^(5*) 
may be defined by 

P(8x)i// (x) = iK* + 8x). (9.35) 

An infinitesimal translation operator I x can be defined by 

iIx s hm E&LlL (9 . 36) 

Sx-*0 OX 

so that 

4,(pc+8x)-^(pc) _ dip 

il x \p(x) = hm — . (9.37) 

Sx-+o ox ax 

From the last equation it is possible to deduce the equivalence 

i ax 

i.e. 

p x =hl x (9.38) 

where p x is the linear momentum operator. This is a particular case of 
the relation 

P=M q (9.39) 

where p is the momentum operator conjugate to q. In general a represen- 
tative operator that commutes with the Hamiltonian is related to a 
symmetry operator of the system. 



9.6 The Full Rotation Group 

The rotational symmetry of the hydrogen atom is intimately related to 
the angular momentum eigenfunctions discussed in Chapter 4. 

Let P((j), z) be an operator that rotates the co-ordinate axes by an 
angle (j> about the z-axis in the positive sense. An infinitesimal rotation 
operator may be defined by 

fl*>=Mm f(5 ^ Z >- 1 . (9.40) 
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Since, in spherical polar co-ordinates 

P(50, zMr, 0, 0) = Hr, 0, + 60) 
then 

tf* ,} *=|£. (9.41) 

00 

The z-component of angular momentum can be written 

^ z ~ i 30 

and so 

S^=nii r \ (9.42) 

This final equation is a special case of (9.39) and shows how axial 
rotational symmetry is related to angular momentum about the axis. 

Consider a diatomic molecule and choose the z-axis to lie along the 
line joining the nuclei. The rotational symmetry of the molecule implies 
that the rotational operator /| r) and hence the angular momentum 
operator JS? Z commute with the Hamiltonian. The z-component of angular 
momentum is a constant of the motion and the eigenfunctions take the 
form (9.25) with eigenvalues mh. The states with \m | = 0, 1, 2, . . . are 
denoted by 2, II, A, . . .. 

Similarly to (9 .42) 

SC x =tiI^ and 2 y =Wtp. (9.43) 

Clearly, the infinitesimal rotation operators must obey the commu- 
tation relations similar to those for angular momentum operators, e.g. 

/«/<'> - #>#> s flf>. (9.44) 

This result can be obtained without reference to the angular momentum 
operators. 

Infinitesimal rotation operators (but not finite rotations) add like 
vectors and if ffi is an infinitesimal rotation about the a-axis 

4 r) = UP + in/W + nl^ (9.45) 

where /, m, n are the direction cosines of the a-axis. This is related to the 
vector properties of angular momentum. 

From (9.40) it can be seen that a finite rotation through an angle 80 
about the z-axis can be expressed by 

P(60, z) = 1 + /50/ z (r) + eS0 
where e -* as 60 -* 0. 
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The effect of the infinitesimal rotation operators on the co-ordinates 
of a point can be represented in matrix form. Consider a rotation of axes 
through 50 about the z-axis in the anti-clockwise sense. From (9.3) and 
(9.1) 

x'\ i x \ / cos 50 sin 80 

y J=.P(50,z)| y 1=1 — sin 50 cos 50 

K z) \z) \ 1 

The operator P(&<t>, z) is represented by the matrix 

cos 50 sin 50 6\ /l 0\ / 

-sin 50 cos 50 ] = | 1 1+1-1 








1 




1 o\ 

1 50 + e60 



1 



0/ 



where e-»-0 as 50-^0 and so the infinitesimal rotation operator IP is 
represented by the matrix 

r -1 0' 

1 

K 0, 

Similarly /£ r) and ffi are represented by 

'0 o\ / 

i( -1 J and ij 

,0 1 0/ \-l 

respectively. These three matrices do satisfy the commutation relations 
(9.44). 

Any rotation about an axis can be expressed in terms of the corre- 
sponding infinitesimal rotation operator. Consider a rotation through 
the angle <j>' about the z-axis. If the wave function is analytic then by 
Taylor's theorem 

P((p',z)Mr, 0,4>)=Hr. 0,0 + *') 

-*(r.9.«+2 ^ 30" ■ 
n=l 
Since 



then 



30 z 

P(0',z>KM,0) = 



+ 2>' 7 ^" 



« = 1 



Hr,S,<t>). 
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This can formally be written 

i»(*',z) = exp(/07i , >) (9.46a) 

or in the general case for a rotation through about the a-axis 

P(e >a )=exp(idl^). (9.46b) 

This is an important result and together with (9.45) shows that any 
rotation can be expressed in terms of the three infinitesimal rotation 
operators /< r >, /jf) and /<'>. 

The full rotation group is the infinite set of all proper rotations about 
all axes through a fixed point. If a function space is invariant under /J r \ 
Iy r ^,Iz then it is also invariant under the full rotation group. The 
converse is also true. 

Angular momentum and irreducible representation 

In the next chapter spin momentum operators are defined through 
appropriate infinitesimal rotation operators and it follows that the spin 
operators satisfy the same commutation rules as those for angular 
momentum. It is worthwhile at this point to reconsider the derivatio'n of 
the allowed eigenvalues for angular momentum by an argument based 
solely on the commutation relations. Essentially this is the problem of 
finding the irreducible representations for the rotation group. (See Heine, 
p. 55, for a direct method of finding the representations.) 

Since the operators & 2 and JS? Z commute, there exists a set of 
functions which are simultaneously eigenfunctions of both these 
operators and any other operators that are required to form a complete 
set. Let ipm be the unique function that satisfies the eigenvalue 
equations 

.§? 2 «// m =a 2 h 2 ^ m 

& z * m =mh^ m (9.47) 

and is also an eigenfunction with given eigenvalues of the remaining 
operators in the complete set. If the system possesses spherical sym- 
metry these operators (apart from JSf z ) will also commute with Sf x and 
JPy . This will be assumed and it is shown below how it is possible to 
generate from t// m all the simultaneous eigenfunctions of the complete 
set that differ only in their eigenvalue of 5£ z . Each of these functions is 
the basis for a one-dimensional irreducible representation ofI$ r \ e.g. 

I { z r) +m =«*«• (9-48) 

It is convenient to introduce two 'shift' (or 'ladder') operators J2P ± 
defined by 

jgf + = js? x + 1 se y se_ = & x - 1 & y . (9.49) 

These operators satisfy the commutation rules 

&, se± - <e± & z = ft se ± . (9 .50) 
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Consider the function HP+^m- Simple algebra gives 

and 

All other operators in the complete set commute with jSf+ and so this 
shift operator generates from \p m a sequence of simultaneous eigen- 
furictions that differ only in their z-component of angular momentum. 
The eigenvalues are 

mh,(m + \)ti,(m+2)ti,.... (9.51) 

Similarly, J5P_ generates from \jj m a sequence of simultaneous eigen- 
function with decreasing z-component of angular momentum 

{m-\)h,(m-2)fi (9.52) 

The number of eigenfunctions that can be generated in this way is 
finite. Any physical system will have a finite value for total angular 
momentum. That is, a 2 is finite. Also, since 

(J2* +#*)*„ = (^ 2 -^?)* m 

= (a 2 -m 2 )h 2 ^ m (9.53) 

then 

a 2 - m 2 > 

as <£\ + &l is a positive definite operator and must have positive 
eigenvalues. The eigenvalues of <£ z are bounded both above and below 
and let l+h and /_ft be the largest and smallest eigenvalues with eigen- 
functions i/// + and i///_ respectively. 
Since l+h is the largest eigenvalue 

&^ U = Q (9-54) 

and similarly 

J2?_lh_-0. (9-55) 

From the commutation result (it is easy to verify it) 

& ± 2+ = & 2 - &z{&z + fi) (9.56) 



then 



and 



jsr- <e+ K = o = [a 2 -n (/+ + 1 )] h 2 *, t 

a 2 =Z + (/ + + !) = /_(/_-!) 
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i.e. 

(/++/_)(/+ -/_+l) = 0. 

Clearly (/+-/_ + 1) >0 as /+ and /_ are the largest and smallest 
quantum numbers and so 

/+ = -/_= /(say) 
and 

a 2 =1(1 + 1). (9.57) 

Since each step in the shift operation is by a unit of h then 

/+-/_= 2/ (9.58) 

is either zero or a positive integer. 

There are (21 + 1) functions which are simultaneously eigenfunctions 
of the complete set of operators i ncludin g JS? 2 and & z which have total 
angular momentum eigenvalue V/(/ + 1 )h. The functions have eigenvalues 

Ih, (1 - l)h, ...,-m (9.59) 

for <£ z . 

The above analysis shows that this set of (21 + 1) functions span a 
space that is irreducible under the set of operators J? z , <£?+ and jSf_ 
and hence under S£ 2 , S£ x , ££ y . From (9.42) and (9.43) this space is 
irreducible under the infinitesimal rotation operators and hence under 
the full rotation group. The irreducible representations may be denoted 
by Z)W where the dimensionality (21' + 1) is a positive integer. 

The argument has been based solely on the commutation relations 
satisfied by the angular momentum operators and is directly applicable 
to the spin and total angular momentum operators to be met in the 
succeeding chapters. 

The one-electron central field problem 

The central field Hamiltonian (5.23) is invariant under the full rotation 
group and so commutes with the angular momentum operators. Its 
eigenfunction must belong to one or other of the irreducible 
representations. /)('). These eigenfunctions are from (5.41) 

t nlmi (r,e,<t>) = R n i(r)Pr i (cos 6)e ,m r» 

and are clearly reduced with respect to rotations about the z-axis 

By using SC+ and JS?_ it is possible to generate the degenerate set of 
eigenfunctions 

*,*«! «!« + /, +(/-!) -/• ( 96 °) 

This set forms the basis for the irreducible representation with 
dimension (21 + 1) 
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The s-functions (/ = 0) transform according to Z)(°), the p-functions 
transform like D ( ! > and the <i-f unctions like D < 2 > . 

The degeneracy of the coulomb potential wave-functions with the 
same principle quantum number but different / quantum numbers is not 
explained by the rotational symmetry of the atom in three-dimensional 
space. 

In the above, / is required to be an integer as the wave-functions must 
be single-valued and so the irreducible representations have odd dimen- 
sion. However, representations of even dimension do exist. In particular, 
the representation Z>< 1/2 > of dimension two will be used in the discussion 
of electron spin in the next chapter. 

9.7 Electron in a Periodic Field 

The lattice periodicity of crystals allows the energy eigenstates to be 
classified by a real wave vector k and the eigenfunctions are called 
Bloch functions. The essentials of the problem are contained in the 
one-dimensional case considered below. 

The Schrodinger equation for a line of identical atoms, distance 'a' 
apart, is 

ft 2 d 2 \J/ 

■Y+V(x)1/=E\I/ (9.61) 



2m dor 

with 

V{x - a) = V(x). 

The substitutional operator representing the translation na is written 
{e | na) with 

{e \na) V(x) = V(x - na). (9 .62) 

If the line of atoms is infinitely long the number of lattice translation; 
is infinite. To keep the number of distinct translations finite it is usual 
to introduce periodic boundary conditions. The finite line is divided up 
into identical 'micro-crystals' each containing TV atoms so that a trans- 
lation through Na produce no change at all, i.e. 

{e\0) = {e\Na). (9.63) 

The translation operators form a group of iV elements and as all 
translations commute with one another, the group is an Abelian group. 
As already mentioned previously in the discussion of the axial rotation 
group, each element in an Abelian group forms a class in itself and, the 
number of irreducible representations is equal to the number of elements 
N. All these representations are one-dimensional. 

The operator V 2 is invariant under a translation of the co-ordinate 
system and so the Hamiltonian is also invariant. Consequently the energy 
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eigenfunctions must transform according to one of the irreducible 
representations of the group. An alternative view point is to note that the 
Hamiltonian and translation operators commute with each other so wave- 
functions may be chosen to be simultaneously eigenfunctions of Jf and 
all the operators {e | na }. 

If \j/ is the basis for one of the irreducible representations 

{e\a}iji(x) = \ilf(x) and {e\na}^(x) = \"^{x) 
where X is a complex constant. The periodic boundary conditions require 

X* = l 
and so X must be one of the Mh roots of unity 

x = e -i2«r/N r =i > 2,...,N. 

If a real wave number k is defined by 

1 2tt 
k = — — r 
N a 

then the irreducible representations and hence the energy eigenfunctions 
may be classified by the wave number and 

{e| naW k (x) = «-'*"■ **(x). (9.64) 

This is Bloch's theorem and a wave-function satisfying this condition is 

t k (x) = <J kx Uk (x) (9.65) 

where u k (x) has the lattice periodicity. 

9.8 Group Theory and Perturbation 

As has been seen in Chapter 7, perturbation of a physical system often 
results in the removal of degeneracies. This is of course to be expected 
as the perturbed Hamiltonian has in general a lower symmetry than the 
unperturbed Hamiltonian. The previously degenerate wave-functions 
now form a basis for a reducible representation for the new smaller 
symmetry group and will split into two or more irreducible represen- 
tations. A degenerate level will divide into two or more levels of 
diminished degeneracy. 

In the discussion of the first -order Zeeman effect in Chapter 7 it was 
shown how the (2/ + 1) degenerate eigenfunctions of the hydrogen atom 
with given n and / quantum numbers split into (2/ + 1) separate levels 
when an axially symmetric magnetic field is applied. 

The crystal field as a perturbation 

Important examples of the removal of degeneracies by perturbation occur 
in the splitting of the energy levels of paramagnetic ions of transition and 
rare earth metals in a crystal. The metallic ion and its surrounding neigh- 
bours (water molecules etc), should be solved as a many-body problem. 



230 A FIRST COURSE IN QUANTUM MECHANICS 

However, it is often a good approximation to consider the neighbours as 
producing an electrostatic crystal field which perturbs the degenerate 
(d or/) eigenfunctions of the free metallic ion. Paramagnetic resonance 
is a tool that has proved useful in experimental measurements. 
The Hamiltonian of an ion in the crystal is written 

where Jf° is the spherically symmetrical free ion Hamiltonian and 4 C 
is the electrostatic potential energy of the ion in the crystal field. In the 
intermediate crystal field case, 4 C is greater than the spin -orbit coupling 
terms (spin is introduced in the next chapter) and the main effect is to 
split the central field orbital degeneracy. The full analysis involves the 
many-electron wave-function and will not be attempted here. A simplified 
discussion is given below. 

Ions of the iron transition group with partially-filled 3 d -she lis often 
belong to the intermediate field case. Often they occur in surroundings 
that are almost cubic, i.e. A c has cubic symmetry. 

The five degenerate c?-wave-functions 

2 (x ± iy)zf(f), (x ± iyffir), y/l (3z 2 - r 2 )f(r) 

form a basis for the irreducible representation D^ of the full rotation 
group. This 5-fold degenerate level splits up into a 2-fold and a 3-fold 
level under the perturbing action of the cubic crystal field. To see this it 
is only necessary to consider the 24 proper rotations of the group 432 
that leaves a cube invariant. (The full cubic group m3m contains 48 
operations and can be expressed as the direct product m3m = 432 x /.) 
The 24 elements are divided into 5 classes as below 

(a) The identity. 

(b) The 4 3-fold diagonal axes making 8 elements denoted by 3. 

(c) The 3 rotations through 180° abou,t the cube edges denoted by 2 Z . 

(d) The 6 rotations through 180° about the diagonals in the sides of the 
cube denoted by 2 d . 

(e) The 6 rotations through ±90° about the cubic edge denoted by 4 Z . 



Table 9.4 The character table for this group is given below. 



432 


E 


3 


2z 


2 d 


4 Z 


Ai 


1 


1 


1 


1 


1 


A 2 


1 


1 


1 


-1 


-1 


E 


2 


-1 


2 








Ti 


3 





-1 


-1 


1 


T 2 


3 





-1 


1 


-1 


E + T 2 


5 


-1 


1 


1 


-1 
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The 5d-wave functions transform according to a reducible representation 
of this group and the characters are given in the last row of Table 9.4. It 
is immediately clear from the table that this representation is composed 
of the two irreducible representations E and T 2 and so the 5-fold 
degenerate level splits up into a 2-fold level and a 3-fold level. 

In many cases the almost cubic crystal field may include smaller 
order terms with a further reduced symmetry and the degeneracy may be 
further reduced. The reader is referred tq Heine, p. 148, and Schutte, 
p. 348, for further details. 
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Figure 9.7 Splitting of 5-fold ZJ^-degeneracy by crystalline fields. 



9.9 Selection Rules 

Group theory is very useful in determining which transitions are allowed 
in the interaction of electromagnetic radiation and electrons in atoms 
and molecules. In Chapter 8 it was shown that the transition probabilities 
for a single electron are proportional to 

X*TXidr\ 2 (8.38) 



IJ 



where X; and \f are the initial and final state functions. 
It can be shown that if the function is one of the basis functions for 
an irreducible representation of a symmetry group then the integral over 
all co-ordinates 

J"</>dr = 

if the representation is not the unit representation. The three components 
of r transform like the three p wave-functions, i.e., according to the 
irreducible representation D^ of the full rotation group. This fact 
together with a knowledge of the symmetry properties of x,- and Xf 
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make it possible to decide when the integral (8.38) is zero and hence 
which transitions are forbidden. This topic is dealt with in detail in 
Chapter 1 1 and will not be pursued here. 

PROBLEMS 

1 Prove that the elements of the transformation matrix a (see equation 
(9.3)) satisfy the relations 

3 

2 «,y =1 for all i 

3 

2 a«y a im =0 ; # m. 
i =1 

Prove also that the determinant of the transformation matrix is ± 1 . 

2 Sketch the two functions 

p y = f(r) sin 6 p x = f(r) cos , 

If 4 Z is the substitutional operator representing a rotation of n/2 radians 
in the anti-clockwise direction about the z-axis, show that 

4zPy=Px- 

3 Describe the eight symmetry operations which leave a square 
invariant. The two faces of the square are not identical. Construct a 
group multiplication table. (This is the group 4mm.) 

4 Repeat question three for a rectangle. (In this case the symmetry 
group is composed of four elements and the group symbol is 2mm.) 

5 Show that the four elements 1 , i, — 1 , — i where i = +>/^T form a 
group. Draw up a group multiplication table. 

6 Consider a system which has a single 4-fold rotation axis in the z- 
direction. Identify the four elements and draw up a group multiplication 
table. This is the group 4. Show that this table can be made to have the 
same form as that of question 5 by pairing off each element of the 
group 4 with a suitable element of the group of question 5. The two 
groups are said to be isomorphic. 

7 A real operator A is related to its Hermitian adjoint A* by 

j(A4>*)\p dr = j<p*A + ip At 

where 0, \p are arbitrary functions and the integration is over all space. 
If A is a linear operator which possesses an inverse v4 -1 it is said to be a 
unitary operator if A + = A -1 . 
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The integral over all space / 0*i// dr is invariant under any reflection 
or rotation of axes. Explain how this implies that a symmetry operator 
is unitary. 

8 Table 9.3 gives the group character table for the equilateral triangle 
group 3m. 

The character of the group element A is the a-irreducible representation 
is denoted by x^fA). Verify the 'orthogonality relations', 

Ix (ar (A) X (l3 \A) = a*l3 

A 

2x {a) \A)x (a \A)=N. 

A 

The summation is over all the N elements in the group. 

9 Suppose x(A) is the character of the group operator A in the 
reducible representation T and x^(A) is the character of A in the 
irreducible representation rO. Then 

X(A) = Ia i x ( - i \A) for all A 

i 

where a,- is the number of times T^ occurs in I\ 

By multiplying both sides by x^*(A) and summing over all members 
of the group show that 

a / = ^2x (/) V)xW 0) 

n A 

where N is the number of elements in the group. (Hint: Use the 
orthogonality relations given in question 8.) 

Consider the reducible representations of the group 3m given by the 
first row of Table 9.2. Use equation (1) to show that this representation 
is composed of irreducible representations A, E. 

10 Write down the 24 symmetry operations which leave a regular 
tetrahedron invariant. This is the group 43m. Divide the group elements 
into the five classes and give the dimensions of all the irreducible 
representations. 

11 In a free atom the atomic orbitals (2s), (2p x ), (2p y ) and (2p z ) are 
degenerate. In a tetrahedral molecule, such as methane the molecular 
field perturbs the free-atom carbon wave-functions and destroys this 
degeneracy. Choose one element from each of the Ave classes of the 
tetrahedral group 43m and construct the matrices of the reducible 
representations with basis (2s), (2p x ), (2p y ) and (2p z ). Show that this 
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representation is composed of a single one-dimensional and one three- 
dimensional irreducible representations. (Hint: Choose the co-ordinate 
axes so that a three-fold axis lies along the (111) direction.) 

12 Suppose a uniform electric field is applied along one of the three- 
fold axes (C— H bond) of the methane molecule. The reduced symmetry 
group at the carbon atom is the group 3m composed of six elements and 
the group character table is given below. 



Number of elements in class 




2 


3 


Typical element in class 


E 


A 


B 


A 2 
A 3 


1 
1 

2 


1 

1 

-1 


1 

-1 




Identify the elements A and B. 

Show that the three-dimensional irreducible representation of the 
group 43 m with basis (2p x ), (2p y ) and (2p z ) is composed of two 
irreducible representations of the reduced symmetry group 3 m. 

This result shows that the three-fold degenerate level splits up into a 
non-degenerate and doubly-degenerate level when the electric field is 
applied. 

1 3 Let a be an axis in the yz plane through the origin at an angle of 
to the z-axis. A physical rotation through <f> about the a-axis is equivalent 
to a physical rotation 8 about the x-axis so that the a-axis coincides with 
the z-axis, followed by a physical rotation through <j> about the z-axis 
followed finally by a physical rotation through — f? about the x-axis. 

i>(-0, a) = P($,xyP(-4>, z)P(-0, x) 

where P(9, x) denotes a passive rotation of the co-ordinate system 
through 6 about the x-axis. Use (9.45) to express 1^ in terms of ty 
and iy^ and with the help of (9.46a) show that 



■/<'>#>=«$'> 



by equating coefficients of the product Q<j>. 
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CHAPTER 10 



Electron Spin 



10.1 Introduction 

The quantum mechanical theory developed in the previous chapters is 
not sufficient to explain all the experimental results of atomic physics. 
It has already been shown that the magnitude of the angular moment um 
of a particle in a central field problem can take the values fi V/(/ + 1) 
where / is an integer. This follows from the differential form for the 
angular momentum operator together with the requirement that the 
wave- function be single-valued. However, half integral values of the 
angular momentum quantum numbers are necessary to explain the fine 
structure in the spectral series of some elements. The sodium D-line 
doublet is a well-known example. 

Also the anomolous Zeeman effect cannot be explained by integral 
values of / alone. Schrodinger theory indicates that the lowest state for 
the sodium atom is non-degenerate with zero angular momentum but 
the Zeeman effect in sodium can only be explained by assuming the 
ground state splits into two levels when a weak magnetic field is applied. 

Apart from this spectroscopic evidence, Stern and Gerlach in 1921 
demonstrated experimentally the existence of the magnetic moment of 
an electron. They did this by passing a narrow beam of silver atoms 
through an inhomogeneous magnetic field and observing that this beam 
split into two beams. (Note that a beam of electrons is not used since 
the Stern-Gerlach separation is then of the order of that imposed by the 
uncertainty principle.) 

10.2 The Electron Spin 

Uhlenbeck and Goudsmit in 1925 suggested that the electron has an 
intrinsic angular momentum and in 1927 Pauli introduced electron spin 
into quantum mechanics. 
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The experimental results can be understood if it is assumed that an 
ele ctron ha s an intrinsic spin angular momentum of magnitude 
h y/s(s + 1) where s = 5 . Further, the component of the angular 
momentum can only be known along any one prescribed direction, 
say the z-direction and must take one of the values m^i with 

m, = ±\. (10.1) 

Also, it must be assumed that associated with the spin angular 
momentum the electron has a magnetic moment equal to —efi/m times 
the spin momentum where m is the electron mass. 

These assumptions arise naturally in the Dirac relativistic theory but 
must be postulated in the non-relativistic theory now under discussion. 

In the Schrodinger theory, the three position co-ordinates of an 
electron form a complete set and when all three are known the electron 
state is specified completely. Now for a complete specification of a state 
the projection of the spin momentum along a prescribed direction must 
be known also. 

Postulate 7. Spin 

It is now postulated that a particle, in addition to its spatial co-ordinates, 
has an internal degree of freedom characterized by a spin co-ordinate a z . 
The particle state function is then a function of four co-ordinates 
\jj (x, y, z, a z ). The spin co-ordinate can only take a finite number of 
discrete values. For electrons, positrons, neutrons and protons, a z can 
take either of the two values ±1 . (This corresponds to the z-component 
of the spin momentum taking the values ±ft/2.) The rest of this chapter 
is concerned only with such particles with spin 1/2. 

The quantum mechanics based upon the seven postulates in this book 
is valid so long as the energies included are small compared with the 
energy associated with the particle rest mass (~0-5 MeV for electrons). 
At high energies a relativistic theory should be applied. 

The operators representing the components of orbital angular 
momentum have been obtained from the classical expressions using 
Schrodinger's substitution for momentum. Because the spin co-ordinate 
can take only two values it follows (see below) that an electron spin 
momentum components can only take the values ±h/2 and hence in the 
classical limit h -*■ 0, spin vanishes. Spin does not occur in classical 
mechanics, and an alternative method must be used to define spin 
momentum. 

In Chapter 9 it has been shown how p, the momentum operator conjugate 
to q, is related to the infinitesimal translation (rotation) operator I q by 

p=hl q . (10.2) 

The infinitesimal rotation operators discussed in section 9.7 transform 
the three space co-ordinates and satisfy the three commutation rules of 
the form (9.44). These rules are a basic property of rotation operators. 
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The spin co-ordinate will be transformed under a rotation of axes and the 
infinitesimal rotation operators ljf\ 1^ and 1^ which carry out this 
transformation must satisfy the basic commutation rules of the type 
(9.44). 

The fundamental result (10.2) can now be extended to define three 
spin momentum operators 

s x =hl x s \ s y =hl y s \ s z =hl¥\ (10.3) 

The spin infinitesimal rotation operators (unlike the space co-ordinate 
infinitesimal rotation operators) do not have a differential operator form 
and so neither do the spin momentum operators. However, the spin 
momentum operators must satisfy the same commutation relations as 
<e x ,& y and i? z .Thatis 

X V V X — *'*"Z 

V z z v — ***"JC 

s z s x -s x s z =ihs y . (10.4) 

As the spin component operators do not commute with each other only 
one of them, say s z , can be known with certainty at any instant. The 
total spin momentum operator defined by 

s 2 op =sl+s 2 y+sl (10.5) 

commutes with all three components and so the total spin momentum 
can be measured simultaneously with any one of the three components. 



10.3 Spin Wave-Functions and Representations 

The wave-function for a single electron is now to be regarded as a 
function of four variables. The three space variables are continuous but 
the spin variable can only take discrete values ±1 . So 

\l/(x, y, z, a z ) = ^+(x, y, z)<*(?z) + ip-(x, y, z)f$(o z ) (10.6) 

where a(a z ) and 0(o z ) are functions such that 
a(l) = l and a(-l) = 
0(1) = and 0(-l) = l. (10.7) 

\p+(x, y, z) is the 'component' of the wave-function with spin co-ordinate 
a z = 1 and ^_(x, y, z) is the 'component' with spin co-ordinate a z = — 1. 
If the electron is in a state defined by a z = 1 say, then 



\p(x.y,z, a z ) = \l/+(x,y,z) 
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In the general case / |i//+| 2 dr is the probability of observing the state 
a z = 1 . The values of a z correspond to the eigenvalues of s z . This is 
explained below. 

A rotation of axes corresponds to a linear transformation of a(a z ) and 
j3(a z ). From (10.6) all functions of a z lie in the two-dimensional spin- 
space spanned by a(o z ) and /3(a z ) and so the space is invariant under 
the rotation group. For particles with spin \ this space must be 
irreducible under the group and a(a z ), $(p z ) form a basis for the 
representation Z>< 1/2 \ Conventionally a(a z ) and j8(a z ) are taken to be 
the eigenfunctions of the rotation operator ffl with eigenvalues 
m s = \, —\ respectively (9.48), i.e. 

Iz >) <*(p z ) = {a{o z ) with m s = h 



or 



and 



or 



s z a(CT z ) = -a(a 3 ) 



ti s) P(°z) = -i P(o.) with m, = -i 



s,P(Pz) = -\ft°,y (10-8) 

The spin operator sip —s* + s$ + s| has eigenvalues i(i + l)ft 2 = \h 2 in 
both states (see (9.59)). The results agree with the requirements 
mentioned in section 10.2. 

If /*(0, z) is the operator that represents a rotation of d> about the 
z-axis and acts only on the spin co-ordinates, then from (9.46) 

Fib z)a(Oz) = exp(ty/2)a(o z ) (10.9) 

and similarly for j3(o z ) 

.-. I*(27r, z)a(a z ) = -a{o z ) (1 . 1 0) 

and 

P s (47r,z)a(a z )=+a(a z ). 

Rotations of 4n must be considered before oc(a z ) and (3(ff z ) are trans- 
formed identically into themselves. These functions are called spinors. 

It has been stated in postulate 1 that all wave-functions must be 
single-valued. This is not strictly true. The physical requirement is that 
integrals of the form 



jipfAil/jdr 
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be single-valued. If \p t and 1/7 both change sign under a rotation of 2u 
applied to all co-ordinates then clearly the integral is invariant. However 
arguments have been given (Blatt and Weisskopf 1952, H. E. Rorschach 
1962), to show that wave-functions depending on the three space co- 
ordinates x, y, z must be single valued as already stated. This argument 
does not apply to wave-functions depending on spin. 



Matrix representation 

The spin momentum operators do not possess a differential operator 
representation. However, a set of representative matrices can be con- 
structed. If the spin eigenfunctions are regarded as basis functions then 
an arbitrary wave-function <//(*, y, z, o z ) is represented by the column 
vector (see (10.6)). 

\jj+(x,y,z) 

(10.11) 
.ty-(x,y, z) . 

Normalization of the wave function \p(x, y, z, a z ) requires 

2 \\Mx,y, 2, o z )\ 2 dr = 2 f [l* + l 2 a 2 (a z ) + l\U 2 2 (a z ) 
a z J o z J 

+ (*:*_ +iK*-Wa z )j3(o z )] dr= 1 

where the summation is over the two discrete values a z = ± 1 . 
From (10.7) 

2«(a z )0(a z ) = O 



Oz 



and 



so 



2« 2 (a z ) = 20 2 K)=l 
"z o z 

j[M + (x,y,z)\ 2 +M_(x,y,z)\ 2 ]dT> 



1. (10.12) 

The probability of the z-component of electron spin being measured 
as +h/2 is 

j\* + (x,y,z)\ 2 dT 
and similarly, the probability of spin — till is 

j\4,_(x,y,z){ 2 dT. 
The column vectors representing ot(o z ) and P(a z ) are respectively 



= 



1 




and 



P = 




I 



(10.13) 
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The matrices representing the spin momentum operators s x , s y and s z 
must be 2 x 2 and the reader can easily verify that the choice 



h 
s x = — 

x 2 



/o l\ = h(o -A = hl\ o\ 

\1 0/' Sy ~2\i o]' Sz 2^0 -1/ 



(10.14) 
satisfies the commutation relations (1 0.4). These can be written 
h 

— I 

2 



Sx=^°x, Sy=tLOy, S z =-O z (10.15) 



with 



"(? i)- 



/ o' CTz= -1 



a y=\, J> CT z=L J 00.16) 



and a x , ay and o z are known as the Pauli spin matrices. All these matrices 
are Hermitian and have real eigenvalues. The correctness of the choice 
(10.14) is confirmed by noting that the eigenvalue equations 



s z o = -a and s z |3=--0 










are represented by 










!(i-9(iH(D - 




till 
2 \0 


-) 


(10.17) 


respectively. 

Note also that from (10.13) and (10.1 


*) 








Sx* = 2& SxP=-<*, SyO.= 


ih 

2 


P, 


s^= 


m 
-T a - 



(10.18) 



10.4 Total Angular Momentum 



It is necessary to know how the one-electron operators which are 
functions of the space co-ordinates and momenta (and not of spin) are 
represented in the new two component theory. Clearly such operators 
must be represented by 2 x 2 matrices. The (1,1) element of the matrix 
representing a Schrodinger operator y which is spin independent, is 

2a(ff z )<ya(ff z ) = y 2<* 2 (a z ) = y. 
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Similarly the (2, 2) element is 

2P(o z )7p(o z )=y'Zp 2 (o z ) = y. 
°z °z 

However the off -diagonal elements are both zero as 
Ia(o z )yP(o z ) =y 2>(o z )0(a z ) = 0. 

°z "z 

So, all spin independent operators, such as y, are represented by a 
diagonal matrix 



y 
y 



= 7 I. 



All such diagonal matrices commute with the Pauli spin matrice. 



Total angular momentum 

The total angular momentum of an electron is defined to be the sum of 
the orbital angular momentum and the spin angular momentum. The 
2x2 matrix operator representing the z-component is 



j z = se z \ + s z 



*JL + A 

i 90 2 







i 90 



(10.19) 



J z has half -integer quantum numbers as J§? z and s z have integer and 
half-integer quantum numbers respectively. It is instructive to demon- 
strate this using the two component theory. The eigenvalues of J z are 
denoted by mjh and in two component form, the eigenvalue equation 
is (in spherical polars) 



fijWl 0\ ft/1 
i 90 \0 1/ + 2 \0 -1 

This represents two equations, 



(::)-*(::)■ 



— +- * + = m,jl> + 



1 

I 


9 i " 

90 2 


1 

i 


9 i 
90 ~ 2 



The first equation may be written 

— = ? K- 5 )^ 
:. !// + (r, 6, 4>)=A + {r, 6) exp [i(rn, - i)«]. 
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As \p+ must be invariant under a rotation of 2ir about the z-axis 

m.j — \= integer 

rttj = integer + \. 
Similarly 

tf/_(r, 9, <t>)=A_(r, 0) exp [i(m f +{)4>] 
and 

rrij = integer —\. 

Therefore the eigenvalues of J z are half-integral multiples of h. 
The matrix operators representing the x and y components of the 
total angular momentum are 

The matrix representing the squared total angular momentum is 

= SC 2 l + s 2 + 2 [ S£ x % x + &ys y + £C z s z ] (10.20) 

as the matrices representing the orbital and spin components commute. 

The matrices J*, i y and J z satisfy the same commutation relations 
as the orbital and spin angular momentum operators and a straight 
forward manipulation will show that each component commutes with J 2 . 
"So, functions can be found which are simultaneously eigenfunctions of 
J z and J 2 . 

Rotation operators 

The total angular momentum can be associated with infinitesimal 
operators representing a simultaneous rotation of both space and spin 
co-ordinates. From the definition, the z-component of the total angular 
momentum is equivalent to the sum of infinitesimal rotation operators 

and similarly for the x and y components. 

Consider now an operator which produces a finite rotation of both 
space and spin co-ordinates through an angle 9 about the z-axis. This 
operator can be expressed as a product of two operators one of which 
acts on the space co-ordinates and the other on the spin co-ordinate. If 
I^P is the operator representing an infinitesimal rotation of all four 
co-ordinates, then from (9.46b) 

exp (idlP) = exp (jBJ<p) exp (/6>/ z *>). 
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This is valid for all 6 and so by equating the coefficients of 6 

#> = /J r) +/,<*>. 

In conclusion, the z-component of the total angular momentum is 
equivalent to the 'total' infinitesimal rotation operator, 1^ multiplied 
by h, i.e. 

and similarly for the x and y components. These three new 'infinitesimal' 
rotation operators I^\ 1^ and IY^ clearly obey the commutation 
relations of the type (10.4). 



Eigenvalues 

The argument given in section 9.7, which depends only on the commu- 
tation relations, now indicates that J 2 has a set of (2/ +1) eigenfunctions, 
with 2/ integer, with common eigenvalue /(/ + \)h 2 . Each of these 
eigenfunctions may be chosen to be simultaneously an eigenfunction of 
J z with separate eigenvalues 

mjtL=jh,{i-\)h,...,-iti. (10.21) 

For a single electron, ntj and hence; is half-integral. 

Addition of angular momenta 

Consider a one-electron system whose orbital angular momentum and 
spin momentum are conserved. It is shown below that for a given / the/ 
quantum number can take only two values, namely / ± \ . 

The four matrix operators J 2 , J z , Sf 2 \ and s 2 all commute with one 
another and the corresponding variables are simultaneously measurable. 
Alternatively, the four matrices JSf 2 I, £C z l, s 2 , s z also form a mutually 
commuting set and the observables are compatible. 

Suppose these two sets of four operators are taken to be complete. 
Any other operators needed to make the sets complete will be assumed 
to have fixed eigenvalues. Normally the Hamiltonian is one of these and 
it must be spherically symmetrical and not include any spin-orbit 
coupling term. For a single electron s 2 can only have the eigenvalue 
3h 2 /4 and any state can be specified by the three quantum numbers 

(i) /, ntj, I or by (ii) /, m ; , m s 

respectively. 

In order to find the different possible eigenvalues of J 2 associated 
with the first set of operators it is useful to consider the labelling of the 
different states arising from the second set. 
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Consider the second set. For a given /, m t can take (2/ + 1 ) values and 
m s can take two values and so there are 2(2/ + 1) different degenerate 
states altogether. Linear combinations of these states can be constructed 
that are eigenfunctions of the first set of operators. Consequently, for a 
given / there must be 2(2/ + 1) different states. 

From (10.19) m,- = m l +m s and the third column of table 10. 1 shows 
how the allowed values for m/ in the second set are obtained while the 
fourth and fifth columns show how they arise in the first set. 



Table 10.1 



Second set 


First set 




m s 


m i 


/' = / + * 


/ = /-* 


rrt[ 


m ; - 


m 7 


I 


+ k 


+ \ 


/ + i 




i 

_ 5 


1 

-5 




/-i 


I- 1 


1 

-5 


1 

- 5 

r 3 

- 5 


/-4 


/-« 


/- 2 


+ i 


i s 


/-! 


/-! 




-5 


' - 5 




-/ 


+'i 


-/ + i 




-i + i 


l 

-5 


-/-i 


-/-i 





It can be seen that all 2(2/ + 1) states for a given /, that occur in the 
first set can be accounted for by letting; take the values 



= / + • 



Z>0. 



(10.22) 



In the language of the 'vector model', the electron spin angular momentum 
is either parallel or antiparallel to the orbital angular momentum. (Note, 
when 1 = 0,]' must be equal to £ ). 



10.5 Spin Magnetic Momentum 



It is well known that a classical electron moving in a circular orbit with 
angular momentum L has a magnetic moment — eL/2m where m is the 
electron mass. Perhaps not surprisingly there is also a spin magnetic 
moment associated with the electron spin momentum. Using relativistic 
arguments (F. Rohrlich, 1965), it can be shown that the relationship 
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between the spin angular momentum of a point electron and its spin 
magnetic moment ja is 

f* = -g es/2m 

where g = 2 to a first approximation. 

(This value of g is slightly in error due to the neglect of radiation 
reactions.) Observe that the ratio of spin magnetic moment to spin 
momentum is twice as large as the corresponding orbital ratio. 



Spin spectroscopy 

Suppose an electron is in a uniform static field of induction B. This field 
interacts with the electron moment to give a contribution 

— (X. B (10.23) 

to the energy. If the spin-independent terms are omitted the Hamiltonian 
can be written 

— [B x s x +B y s y + B z s z ] (10.24) 

m 

where B x ,B y andB z are the components of the field. 

If the field is parallel to the z-axis, the time-independent Schrodinger 
equation becomes 



eh 

2m 



This two component equation has two eigensolutions. The upper 
level has energy 

t - etl o 

2m 
with eigenvector a and the lower level has energy 

E-=-—B z 
2m 

with eigenvector P. So the effect of an applied static magnetic field B is 
to split an electron energy level up into two levels with energies 
±ehB z j2m corresponding to the two spin directions. This is essentially 
the explanation of the anomalous Zeeman effect in atoms with one 
valence electron. 

One method of observing these levels is by electron spin resonance. 
The system in the static field B is subjected to an alternating electro- 
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magnetic field of angular frequency to. When the frequency satisfies 

hco = ehB z /m 
i.e. 

co = eBjm 

transitions are induced and resonance occurs with energy absorption 
from the alternating field. Electron resonance experiments are generally 
carried out at fixed frequencies and the field B z varied. Such experiments 
can give useful information about the magnetic moments of more com- 
plicated systems and hence about their structures. 

Atomic nuclei also have spin and associated magnetic moments. The 
splitting of the nuclear levels in a magnetic field can also be observed by 
resonance techniques. The resonance frequency for electrons is in the 
micro-wave region whereas the heavier mass of the nuclei result in a 
smaller (hyperfine) energy splitting and the corresponding resonance 
frequencies are in the radio frequencies (see problem 7). 

Fine structure 

The energy expression (4.71) does not account completely for the 
observed spectrum of hydrogen. Many of the spectrum lines are com- 
posed of several components illustrating a fine structure. In 1916, 
Sommerfeld modified the Bohr atom theory by taking account of the 
relativistic mass change of the electron in an elliptic orbit. He found that 
this introduced a small energy correction, so that levels with the same 
principal quantum number but a different azimuthal quantum number 
have slightly different energies. The corresponding fine structure gave 
good agreement with the observed splitting for hydrogen. 

Even before the development of quantum mechanics it was evident 
that this theory did not explain the spectra of the alkali atoms. An analy- 
sis of the penetration of the single valence electron into the ion core 
shows that the energy levels depend very much on the azimuthal quantum 
number and that Sommerfeld's relativistic mass effect is small in 
comparison. 

The spectrum lines corresponding to a transition from the level (n, /) 
to the level («', l 1 ) often show a fine structure which is of the order given 
by Sommerfeld's correction although his argument cannot apply as any 
splitting of the upper (lower) levels must produce two or more levels 
with the same azimuthal quantum number. The doublet splitting of the 
alkali atoms can be explained on the basis of electron spin and the 
theory is outlined below. The fine structure of hydrogen is really a com- 
bination of spin and relativistic mass change effects. Of course spin itself 
follows from a relativistic formulation of quantum mechanics. 

The energy levels associated with the excitation of the single valence 
electron of an alkali atom can be evaluated using a single-electron model 
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in which the combined effect of the core electrons is represented by a 
non-coloumbic central field V(r). The accidental degeneracy of the levels 
with the same principal quantum number but different / is then removed. 
Ignoring spin, the Hamiltonian 

H 2 
tf ° = - — V 2 + V(r) 
2m 



commutes with the orbital angular momentum operators and the energy 
levels have degeneracy (2/ +1). Each eigenfunction may be specified by 
the three quantum numbers n, I. m/. On the incorrect assumption that 
the Hamiltonian is unaffected by spin, each of the levels mentioned 
below is now composed of two levels corresponding to m s = ±\. The 
degeneracy is doubled to 2(2/ + 1). 

However, the Hamiltonian is really changed on the introduction of 
spin and the 2(2/ + 1) levels are not in fact all degenerate. This effect is 
called 'spin-orbit' coupling. 

In a frame of reference that is stationary relative to a nucleus an 
observer will only experience the electric field S. However, in a frame of 
reference moving with an electron orbiting the nucleus, an observer 
would experience a magnetic induction, 

B = -VA«f/c 2 (10.26) 

where V is the orbiting electron velocity and c is the speed of light. 
This is a relativistic effect (Ponofsky and Phillips, p. 284). 

This field interacts with the spin magnetic moment to give a 
contribution 

-ij*.B (10.27) 

to the total energy. The factor one-half is the Thomas factor (1926) and 
is due to the fact that the Hamiltonian is expressed in a field of 
reference in which the nucleus is at rest. 
The electric field may be written 

-i-^i (10.28) 

e dr r 

(take the nucleus at the origin) where V{r) is the central field potential 
energy. 

By combining (10.26), (10.27) and (10.28), it can be seen that the spin 
orbit interaction term to be added to the central field Hamiltonian ^f° is 

1 dV 
A s — { se x *x + &ys y + Se z s z ). (10.29) 

2c mr &r 

This perturbation is a small effect but does give rise to fine structure of 
spectrum lines. 
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The 2x2 diagonal matrix Jf °I representing the original central field 
Hamiltonian commutes with the matrices representing the orbital 
angular momentum operators and the spin operators, and the five 
matrix operators 

jf°i, sen, & z \, s 2 , Sz 

form a complete set of commuting operators. For a single electron, s 2 
always has the eigenvalues 3h 2 /4 and so a state is specified completely 
by the four quantum numbers n, I, mi and m s . Alternatively the 
commuting set of matrix operators 

jt°i, j 2 , j z , sen, s 2 

is complete and the quantum numbers n, j, m,- and / completely define 
a state. 

The new Hamiltonian, including the spin-orbit coupling 

no longer commutes with the orbital momentum operator J£? 2 I and the 
spin operator Sz and so m/ and m s are no longer good quantum numbers. 
From a symmetry viewpoint, the Hamiltonian is no longer invariant 
under a rotation of space co-ordinates or of spin co-ordinates alone. It is 
however invariant under a simultaneous rotation of both the space and 
spin co-ordinates. 

The new Hamiltonian forms a commuting set with the operators 



&H, 



2 



However, whereas the unperturbed energies depend only upon the 
quantum numbers n, I the perturbed energies depend also upon/. Under 
the action of the spin-orbit coupling (10.29), the 2(2/ + 1) degenerate 
levels with a given n, I split into a number of distinct levels which differ 
only in their values of/. 

The spin-orbit coupling term can be written (10.29) 

1 AV 

. 2 2 T 1 ^ 2 - ^ 2 I " s 2 }- 00.30) 

4c 2 m 2 r 6r ' 

This is strictly a problem involving perturbation of a degenerate level, 
but if the unperturbed wave-functions are specified by n, f, rrij, I the 
off-diagonal terms of the perturbation determinant (Chapter 7) are 
already zero and to a first order, the change in energy of a level is the 
expectation value of (10.30), i.e. 

^Cf)./^ "'^ "^ (10,31) 
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For a given n and a given / ¥= 0, the level splits into a doublet correspond- 
ing to /=/ ± ^ . The energy separation between the levels is 

h 2 A dv\ 

5£ = -^2(2/ + l)(- — ) . (10.32) 

Am c \r ar/ n i 

There is no splitting of an atomic level with / = as in this case / must 
equal \ . 

The sodium D-lines may be explained by this theory and (10.32) 
gives the correct order of magnitude although there is difficulty in 
specifying V(r) exactly. 



Lande g-factor 

An important quantity in the analysis of atomic spectra is the ratio of 
the magnetic moment of an atom to its angular momentum. This is the 
gyromagnetic ratio and it is usually written as 

g-^- (10.33) 

2m 

where \g\ is the Lande g-factor. The g-factor for spin is equal to 2. 

In Chapter 7, in the discussion of the weak-field Zeeman effect for a 
central-field (spin-less) electron in a uniform external magnetic field B 
in the z-direction, it is shown that the extra terms to be added to the 
Hamiltonian due to B is 

eB h 3 e , 

— s BSe z (7.77) 

2m i 90 2m z 

where m is the (reduced) electron mass. From (7.77) it is clear that the 
g-factor for orbital momentum is unity. 

When spin is included the field interaction term to be added to the 
Hamiltonian is 

4=^-(^ 2 I+2s z ) = ^-(J z +s z ). (10.34) 

2m 2m 

If the field is weak enough this term is small compared with the spin- 
orbit coupling and may be regarded as a perturbation to the Hamiltonian 
including spin-orbit coupling. The unperturbed eigenfunctions are speci- 
fied by the quantum numbers n, j, m/, I. The perturbed Hamiltonian does 
not commute with J 2 and so /' is strictly no longer a good quantum 
number and off -diagonal terms do occur in the perturbation determinant. 
However it can be shown (Bethe, p. 1 14) that the non-zero terms occur 
between states of different / and not between states of the same / and 
different m/. The off-diagonal terms can be neglected because of the 
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relatively large energy separation between states of different / due to 
spin-orbit coupling. 

The perturbation produces a first-order shift in energy of the state 
(«, /, ntj, I) equal to the expectation value of (10.34). It can be shown 
that (p. 61-64 in Condon and Shortley) 



< s, ) = c U z ) 



where 



2/(/ + l) 



(10.35) 



Hence the change in energy of the state (w, /, mj, I) on introduction of 
the magnetic field is 



(1 +c) BmjH. 

2m 



(10.36) 



The (2/ + 1 ) states with different values of my now have different energies 
and the energy change is proportional to the component of total angular 
momentum parallel to the field. 



b = o 



b + o 



i-U-i- 



Spin-orbit 
coupling 



/ = W = i 



/ = 0,7 = i 



lehB/m 



Figure 10.1 The Zeeman effect. 
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From (10.36) the gyromagnetic ratio is 
e 



(1+c) 



2m 



and so the ^-factor for the system is (1 + c), i.e. 

, /(/ + !)-/(/ + !) +5(5 + 1) 

* =1 + W+l) 

For a single electron s = \ and for 

/ = /+*, *=! + 
and for 



2/+1 

1 
2f + l 



(10.37) 



For an s-state / = and g = 2 and the two levels split by 

en — 
m 

The anomalous Zeeman effect of the alkalis is explained in this way. 





VX/+1), 



The effective average magnetic 
moment is the component M 
along the direction of 
total angular momentum. 

\i = /liCOSfly + JI.COsfl,, 



Figure 10.2 The vector model for the Lande-factoi. 
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In a strong magnetic field the interaction term (10.34) may be large 
compared with the spin-orbit coupling. Then the spin-orbit interaction 
is regarded as the perturbation. This is the Paschen-Back effect. 



PROBLEMS 

1 Verify that the matrix representation (10.14) satisfies the commu- 
tation relations (10.4). 

Show that the matrices representing s x and s y have eigenvalues ±h/2. 

Show further that the matrix representing s 2 is 



hi ( 3 o\ 

4\0 3J 



and deduces 2 has only one eigenvalue 3h 2 /4. 

2 Define operators a, (3, y by the relations 

a = 2s x /ih, $ = 2s y /ih y= 2s z /ih 
Using (10.14) show that 
a 2 =p 2 =Y 2 =-I 
and further that 

Py=-yp=<* 

ya=-ay=(i. 

These results show the relation between the spin operators and 
quaternions. 

3 Consider the Pauli spin matrices (1 0. 1 6). Verify that 

a x -a y -o z - +1 
and obtain the three commutation relations of the form 

o x o y — o y a x = 2ia z . 
From these commutation relations show that 

O x Oy + OyO x — 

and deduce 



Compare these results with the similar results obtained for the matrices 
cc,j3, y in question two. 
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4 From equations (10.3) and (10.14) deduce that the 2x2 matrices 
representing the infinitesimal rotation operators I^\ ^ and ^ in the 
irreducible representation Z)( 1/2 > are 



< :) (° :) t _?) 



respectively. Verify that the commutation relations (10.4) are satisfied. 

Use equation (9.46) to show that the corresponding matrix 
representing the finite rotation P s (<p, z) is 

y*/ 2 o \ 

e-'*/ 2 /' 

5 In the isotopic spin theory protons and neutrons are regarded as two 
distinct states of the 'nucleon'. The two states are defined by the two 
possible values of the isotopic spin co-ordinate t z .t z = + 1 and r z = — 1 
correspond to the proton and neutron respectively. A nucleon wave- 
function can formally be written 

\l/(x, y, z, t z ) = \p+(x, y, z)a(r z ) + \p_(x, y, z)P(t z ) 

with 

a(+l)=l and a(-l) = 

0(+l) = O and |3(-1) = 1. 

In analogy to the electron spin matrix representation, confirm that 
the matrix representation 



-«i) - : 



-(i -:) 



is valid. 

The electron charge of a nucleon is represented by the matrix 

Confirm that the proton and neutron state vectors are eigenvectors of q 
with eigenvalues +e and respectively. (For further information the 
reader is referred to Heine's book.) 

6 When a magnetic field of induction B is applied to an electron and if 
all other contributions to the energy are neglected, the Hamiltonian is 

— [S X B X + SyBy + SztfJ . 
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Show that the eigenvalues of this Hamiltonian are 

E = ±—B where B = (B* +B* +B*) 1 ' 2 . 

2m 

If B is at an angle to the z-axis then 

B Z =B cos 6 and Bl + By = B 2 sin 2 0. 
Show that in this case if the normalized two component wave vector is 

U- 
then 

|iM 2 = cos 2 - and |^_| 2 = sin 2 -. 

7 Suppose, in question 6, the field is along the z-axis, then the 
Hamiltonian is 

eB 

— s z . 
m 

It has been shown in section 10.5 that the eigenvalues of this Hamiltonian 
are E = ± ehB z l2m corresponding to the z-component spin eigenvalues 
±A/2. 

However, if the state function is not an eigenfunction of s z the 
equation that must be satisfied is the Schrodinger time-dependent 
equation. 



2^1° -iJUrM*-)- 



Verify that if initially at t = the state function is an eigenvector of s x 
with eigenvalue +h/2, then at time t, the state function is 



where 

co = eB z /m. 



exp [-/cor/2] 
exp [z'tor/2 ] 



Confirm also, that at t = w/2co the state function is an eigenvector of s y 
with eigenvalue h/2 and at t = 27r/co the state function is once more an 
eigenvector of s* with eigenvalue h/2. 
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These results show how the spin magnetic moment vector processes 
about the applied magnetic field with angular frequency eB z /m. (This is 
identical to the classical frequency.) 

8 The relativistic change in mass of an electron may be taken into 
account by considering the perturbation 

A = -p 4 /8n 3 c 2 

where p is the operator representing linear momentum ju is the reduced 
mass and c is the speed of light. 
Explain why 

2 

4> 



^4 



p>=V 



E + - 



4ire r 



where \p and E are the eigenfunctions and eigenvalues for a hydrogen-type 
atom. Hence show that the first-order perturbation of the energy due to 
relativity, is for the state (n, I, mi) 



£(1) = _ 



1 



M e 8 Z 4 /(47re )Vc 2 . 



(2/ + l)n 3 8n 4 
This term should be added to the spin orbit coupling term in (10.31). 
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CHAPTER 11 



Systems of Particles 



11.1 Introduction 

The basic theory developed through the postulates in the earlier chapters, 
applies to many particle systems. In Chapter 4 it was shown that the 
Hamiltonian operator representing a system of two particles of masses 
m x andm 2 at positions (xi.j'i.Zi) and (x 2 , y 2 , z 2 ) respectively is ( 4 - 45 )> 



2m x 



2 + a„2 + -.2 



bxi dyf bzi 
+ V(x u y u z u x 2 ,y 2 ,z 2 ). 



Im-i 



3x| dy 2 9z 2 



The state function ^ depends upon all six co-ordinates and the time and 
satisfies Schrodinger's time dependent equation (Postulate 3), 

3fV = - . 

i 3r 

The normalization condition takes the form 

JJ*** dx l dyi dzj dx 2 dy 2 dz 2 = JJ^*^ dr t dT 2 = 1 

where the field of integration is over all possible values of the six co- 
ordinates. 

These results are easily extended to a system of n particles. The 
classical Hamiltonian is 



1=1 



tf-2£ +jr fr. '*-.'"> 
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where r,- is the position of the ith. The quantum mechanical Hamiltonian 
operator is 

The state function depends upon the values of all the co-ordinates and 
the time and again satisfies the time dependent Schrodinger equation. 
An important Hamiltonian is that of a free atom with n electrons. 
Ignoring the translational energy and the spin-orbit interaction terms, 
this operator is 

. 2 " " ^ 2 i " " 2 

#E ^V v ?VJL + iV5 £ 

2m *« ' *- 4we r, 2 ^ *- 47re 1 r,- — r 7 - 1 

i' = l i = 1 i = l / =1 

(/ * /) 

(11.2) 

where m is the electron mass and Z is the atomic number. The operator 
V? operates only on the co-ordinates of the r'th electron and | r^ — r ; - 1 is 
the separation between the ith and ;th electrons. 

11.2 Angular Momentum of Systems 

Orbital angular momentum 

Classically, the orbital angular momentum, of a system of n particles, 
about the origin is 

n 

L = 2 r ; A Pl 
i=i 

where p,- is the linear momentum of the z'th particle at position r,. The 
quantum mechanical operator representing the z-component of the 
angular momentum of the system is 

'••i%{«k-»kY%>*' (n ' 3) 



where 

* I 3 9 



is the operator representing the z-component of the ith particle. Similarly 
it is possible to define the operators J% and & y representing the x and 
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v components of the orbital angular momentum of the system and the 
square of the angular momentum of the system is 

It is easy to deduce that the angular momentum operators of the 
system satisfy the same commutation rules as those for a single particle, 
thus 

Se x & y - & y & x = ih S£ z etc. 

(Note that & ix , J5? ; >, i ¥= j, etc. commute as they refer to different 
co-ordinates.) Also JS? 2 commutes with £P X , J& y and & z . 

The angular momentum operators can be related to infinitesimal 
rotation operators. From (1 1 .3) 

where ifp is the infinitesimal operator that operates on the space co- 
ordinate of the ith particle only. By a similar argument to that given in 
section 10.4 concerning rotation operators and total angular momentum, 
it can be shown that the summation on the right hand side of (1 1 .4) can 
be replaced by a single infinitesimal operator producing a simultaneous 
rotation of the space co-ordinates of all the particles. Consequently, 
orbital angular momentum is conserved if the system is invariant under 
a simultaneous rotation of all the space co-ordinates. This is the case for 
the approximate Hamiltonian (1 1 .2) for a free atom. 

The argument given in section 9.6 implies that J? 2 has a discrete set 
of (2L + 1) eigenfunctions, 2L an integer, with common eigenvalues 
L(L + l)h 2 . Each of these functions may be chosen to be an eigen- 
function of ££ z with separate eigenvalues 

M L h=Lfi,(L- l)h -Lh. 

Single-valuedness of the space wave-function once again requires L to be 
an integer. (Note that L, M L are used as the quantum numbers for a 
system of particles and /, mi are used for a single particle.) 

The approximate free atom Hamiltonian (1 1 .2) is invariant under a 
simultaneous rotation of all the co-ordinates and so commutes with the 
system angular momentum operators. The quantum number L is a good 
quantum number and may be used in the classification of states. The 
states with L = 0, 1 , 2, ... are labelled by the letters S, P. D 



Product space 

In a two-particle system where the two particles do not interact, the 
Schrodinger equation is separable and the two-particle wave-function 
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can be expressed as a product of one-particle wave-functions 

"/ , (ri,r2) = ^i(r 1 )i// 2 (r 2 ). 

Products of this type can be used to define a product space. 

The term 'function space' was defined in Chapter 3. Suppose the 
linearly independent set of functions {0,}, / = 1 , . . ., n forms the basis 
for an n-dimensional function space and similarly {$,},/ = 1 , . . ., m 
forms the basis for an m-dimensional space. The nxm linearly indepen- 
dent products 0/0/ are the basis functions for an n x /w-dimensional space 
which is called the product of the original spaces. Any function in this I 
product space can be expressed | 

n m 

i i 
where the n x m coefficients -dy are constants. 

Addition of angular momenta 

Consider a system of two particles and suppose that the operators 
representing the angular momentum components of each particle 
commute with the system Hamiltonian so that the angular momentum of 
each particle is conserved. This implies that the Hamiltonian is invariant 
under a rotation of the co-ordinates of each particle separately. This 
would happen in the case of a two-electron atom where the electron- 
electron interaction term (the last term in (1 1 .2)) is ignored or approxi- 
mated by a smoothed out, separable, central-field potential. 

The operators J5? 2 , & z , <£\, &\ where & 2 , & z refer to the angular 
momentum of the system and £P\, &\ refer to the separate particles, . 
commute with one another and with the Hamiltonian and may be 
measured simultaneously. Alternatively the operators J2ff, S£\ z ,&\, 
&i z where S£\ z , £?iz reIer to the separate particles, commute with one 
another and with the Hamiltonian and so their eigenvalues may be known 
simultaneously. Either of these four sets of four angular momentum 
operators, when combined with certain other operators will form a com- 
plete set. Assuming the other operators have fixed eigenvalues the state 
of the system is specified by either the set of quantum numbers (i)L, M L , 
li,li or by (ii)/i, nt\, h,m2- The quantum numbers L,M L refer to the 
system while /i, m t and l 2 , m 2 refer to the first and second particles 
respectively. 

To find the different possible values of JS? 2 associated with the first 
set of operators it is useful to consider the labelling of the different states 
arising from the second set. 

Consider the second set -2? 2 , jSP lz ,-§?§, £? 2z - For a given /^ the quan- 
tum number mi, can take (21 1 +1) values and similarly for given / 2 , m 2 
can take (2/ 2 + 1) values. So for fixed /j and l 2 there are (2/i + l)(2/ 2 + 1 ] 
different degenerate states that span a sub-space of the function space of 
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the system. The eigenfunctions may be written </> (/i , / 2 , m x , m 2 ) and the 
largest allowed values for m i , m 2 are /j , l 2 respectively. The space of the 
two-particle system can be expressed as the product of two spaces each 
describing a single particle, i.e. 

4>(li,h,m l ,m 2 ) = 4>(l 1 ,m 1 )4>Q2,ni 2 ) 

where 4>Q\, ni\) and 0(/ 2 , m 2 ) describe the first and second particles 
respectively. 

Consider now the first set of operators jSf 2 , J5? z , ££\, S£\. For a fixed 
l\ and l 2 there must be (2li + l)(2/ 2 +1) different states spanning the 
same sub-space of the system space. These eigenfunctions may be 
written i//(L, M L , l lt l 2 ) and are obtained by taking appropriate linear 
combinations of the eigenstates of the second set of operators. The 
coefficients are the Clebsch-Gordon coefficients, i.e. 

ty(L, M L , l u l 2 ) = 2 2 cQi, l 2 , m u m 2 \L, M L )i>Q lt l 2 , m u m 2 ). 
Since 

the largest allowed value for M L is (^ + 1 2 ) obtained by combining 
m x = l x and m 2 = l 2 . It follows that the largest L is also (l t + / 2 ). This 
system wave-function is the product <t>(li,li)4>(l 2 ,l 2 ). 

There are two states in the second set with the next value for M L . 

These occur with m.i=ly,m 2 =l 2 -\ and mj = l\ — 1 , m 2 = l 2 . By 
suitably combining these there must be two states in the first set with 
this value ioiM L . These are 

L = (h + fa), M L =L-1 and L = (/, +/ 2 - 1), M L =L. 
There are three states in the second set with 

and hence three such states in the first set with L = (l t + 1 2 ), Q\+l 2 — l) 
and (/j +1 2 — 2) respectively. For the next value 

M L =(I l +l 2 )-3 

there are four states and so on. 

The number of states with a given M L does not increase indefinitely 
and in fact the maximum number occurs for 

M L =\h -hi 
This implies that the minimum value for L is I /] - l 2 \. 
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Finally, for a given /j , l 2 the system quantum number L can take the 
positive values 

i.e. 

(1 1 +1 2 )>L>\1 1 -1 2 \. (11.5) 

This is the triangle inequality. If/] > l 2 there are (2/ 2 + 1) distinct values 
for L. For each value of L there are (2L + 1) values of M L and the total 
number of states is 

(', + h) 

2 (2L + 1) = (2J,+ l)(2f 2 + l) 

as required. 

The vector coupling rule (1 1 .5) can be used to add three or more 
angular momenta of particles together by successive applications. The 
reader is advised to compare the work above with the discussion of total 
angular momentum in section 10.4. 

There is an equivalent group theoretic way of discussing the addition 
of angular momentum. 



Product representation of a group 

Let the two function spaces {0,-} and {tfy} each be invariant under the 
group of operators {A}. The product space is also invariant under the 
same group of operators and the set of functions {0 ,-<£,•} forms an n x m- 
dimensional representation of the group. In general the product space 
will be reducible into a sum of sub-spaces each irreducible under the 
group. 

As an illustration consider the two sets of functions {^3m t }l = 3, 
m t = 3,2, 1,0, -1, -2,-3 and {iAi m /K = l>"»/= 1,0, -1 which trans- 
form according to the irreducible representations D^ and D^ respect- 
ively of the full rotation group. (Note that i// 3m , =Pf' (cos 0)e""'* etc.) 
The 21 -dimensional product space is invariant under the rotation group 
and transforms according to some reducible representation which is 
denoted by D^ x D^. The 21 functions in the product space can be 
split up into three sets, one containing nine functions forming the basis 
for the irreducible representation D^ another containing seven functions 
and transforming according to D< 3 ) and finally a set of five functions 
transforming according toD( 2 \ The result can be written 



£)( 3 > xlK 1 ) =Z?< 4 ) +£>< 3 > +D<- 2 \ 
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For a more complete discussion the reader is referred to Group Theory 
in Quantum Mechanics, by V. Heine, or Group Theory and Quantum 
Mechanics, by M. Tinkham. 

More generally, if two sets of functions form the basis for irreducible 
representations Z)Wi) and D^i^, the product space transforms according 
to the reducible representation 

Z?0'i> x £)(/*) =£>(/'. + / 2 ) +Diii +/ 3 -i) +. . . +£>I'W*I. (11.6) 
This is of course another statement of the vector coupling rule (1 1 .5). 

Parity 

Consider the inversion operator / which is defined by 

14>{t\,r 2 , . . .,r„) = i//(-r 1( -r 2 , . . ., -r„). 

That is, it changes the sign of each of the position variables of all the 
particles in the system. The eigenvalue equation for the inversion 
operator is 

fy = pty 

where p is an eigenvalue and ^ is an eigenfunction. If the inversion 
operator is applied twice the function must be converted back into 
itself. The operator I 1 is equivalent to the identity E and 

I 2 \P=p 2 ^j = 4/ 

p 2 = I and p- ±1. 

The inversion operator has two eigenvalues/? = ±1 and the eigen- 
functions are either 

(a) unchanged in sign when acted upon by the inversion operator (even 
functions) or 

(b) changed in sign when acted upon by / (odd functions). 

The state is said to have even or odd parity respectively. 

For some systems the parity eigenvalues are good quantum numbers. 
Consider a closed system of particles with no external forces acting. 
Then the potential energy depends only upon the particle separations and 
the quantum mechanical Hamiltonian is 

Clearly rff commutes with the inversion operator, i.e. 
MI - 13^= 0. 
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Functions can be chosen which are simultaneously eigenfunctions of both 
Jf and /. 

In a central field there is a term 

2v(\t,\) 

i 

to be added to the Hamiltonian above (see (1 1 .2)). This Hamiltonian also 
commutes with I and the parity eigenvalues remain good quantum 
numbers. 

The inversion operator also commutes with the angular momentum 
operators of the system. In particular 

and 

£C 2 I-I^ 2 =0. 

For a central field problem (or for the Hamiltonian (11.7) the inversion 
operator, the square of the angular momentum, the z-component of the 
angular momentum and the Hamiltonian form a set of commuting 
operators and can be simultaneously measured. 

Consider a single -particle, hydrogen-type wave-function. In spherical 
polar co-ordinates 

tnlmr R nfi-)PT' ! («>S 0) «""'*. 

The inversion operator corresponds to the transformation 

/■-»■/•, 0->7r-0, -*• n + <t>. 
As 

and 

if '(cos (it - fl)) =P, w '(-cos 0) = (-l) , - ,B 'P, m '(cos 0), 

the wave-function is multiplied by (—1)' when operated on by the 
inversion operator, i.e. 

The parity of the state is independent of the quantum number mi and 
depends only upon /. If/ is an even integer, the state has even parity 
whereas if I is an odd integer the state has odd parity. 

The parity of an n- particle wave-function which can be expressed as 
a product of n single particle central field wave-functions is 

(_l)'. + '7 + - + 'n 

where /,- is the angular momentum quantum number for the z"th particle. 
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The parity concept is useful in considering transition probabilities 
and is particularly useful in classifying nuclear energy states (Theoretical 
Nuclear Physics, by Blatt and Weisskopf). 

The conservation of parity in quantum mechanics has no equivalent 
in classical mechanics as the invariance of the classical Hamiltonian under 
inversion does not lead to any interesting results. 



Spin angular momentum 

The operator representing the spin momentum of an electron is defined 
in terms of the infinitesimal rotation operators, which transform the 
spin-co-ordinates, by equation (1 0.3). The operator representing the z- 
component of the spin momentum of the system of n particles is 



= Z */, (11-8) 



where 



? . =hl(*> 



is the operator representing the z-component of the z'th particle. Similarly 
it is possible to define the operators S x and S y representing the x and y 
components of the spin momentum of the system and the square of the 
angular momentum of the system is 



"op ~" Ax + Sy + "z - 



The spin momentum operators of the system satisfy the same commu- 
tation rules as those for a single particle. 
From (11.8) 



S z =h Zlf? (H.9) 

/ = i 

where ifp is the infinitesimal operator that transforms the spin co-ordinate 
of the zth particle only. The right hand side of (1 1 .9) can be replaced by 
a single infinitesimal operator producing a simultaneous rotation of the 
spin co-ordinates of all the particles. So, spin momentum is conserved if 
the system is invariant under a simultaneous rotation of all the spin 
co-ordinates. This is the case for the approximate free atom Hamiltonian 
(1 1 .2). In fact this Hamiltonian is invariant under a rotation of the ith 
co-ordinate alone and so the spin momentum of each separate particle is 
conserved. 

The argument given in section 9.6 implies that Sop has a discrete set 
of (2S +1) eigenfunctions, 2S an integer, with common eigenvalue 
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S(S + l)fi 2 . Each of these functions may be chosen to be an eigenfunction 
of S z with eigenvalues 

M s h=Sh,(S-\)ft -Sh. (11.10) 

(The reader is warned not to confuse the operator Sop with the quantum 
number S.) For a single electron S = s = \ . 

Consider a system composed of two electrons. If the spin interaction 
terms are ignored, the system Hamiltonian is invariant under a separate 
rotation of each spin co-ordinate and the spin momentum of each elec- 
tron is conserved. For each separate electron s = \ and by following an 
argument similar to that leading to equation (11 .5), the system spin 
quantum number S can take either of the two values 



S = 1 with M s = \,Q,-\ 


(triplet) 


S = with M s = 


(singlet) 


Alternatively, from (1 1 .6) 




£0/2) x £(1/2) =£(1> +Z? (0). 





The four products 

cl{o u )cl{o 2z ), 0(a lz )j3(a 2z ), a(a lz Ma 2z ), j3(a u )a(a 2z ) 

where o lz , a 2z are the spin co-ordinates of the two electrons, form the 
basis for the reducible representation D^P) x D^ 2 \ From these it is 
possible to construct the three functions 

spin quantum number 
S M s 

a(o lz )a(o 2z ) 1 1 

P(°uM°2 Z ) 1 -1 

-^[a(ff lz ))3(a 2z )+j3(a lz )a(a 22 )] 1 

which transform according to D^ and the single function 

S M s 

-L[a(a lz )/J(a 2z )-/J(a lz )a(a 2r )] 

which transforms according to Z)(°). All of these functions are ortho- 
normal. From the spin operator viewpoint, each of the three functions 
that transforms according to D^ is simultaneously an eigenfunction of 
Sop with eigenvalue 2ft 2 and also an eigenfunction of S z with eigenvalues 
h,0.—h respectively. The single function that transforms according to 
Z>(°> is an eigenfunction of sf p and S z with common eigenvalue zero. 
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The reader is advised to verify the results using equations (10.17), 
(10.18) and noting that (from, (1 1.8)) 

Si p = (s 2 i x + s\ y + »?,) + (si* + sly + s\ z ) 

+ 2(s lx s 2x +s ly s 2y + s lz s 2z ). 

Total angular momentum 

The total angular momentum of the system is defined to be the sum of 
the orbital angular momentum and the spin angular momentum of the 
system. In particular 

J t =X t + S t (11.11) 

and similarly for J x and J y . The square of the total angular momentum is 

J 2 =J 2 +J 2 +j! (11.12) 

and these operators satisfy the usual commutation rules. 
From (11.4), (11.8) and (11.11) 

'«=» i(4 r) +^) s » i 4 r) (n.13) 

i = 1 1=1 

where ifp is an infinitesimal operator producing a simultaneous rotation 
of both the space and spin co-ordinates of the ith particle. The right hand 
side of (1 1 .1 3) can be replaced by a single infinitesimal operator pro- 
ducing a simultaneous rotation of all the co-ordinates of all the particles. 
Consequently, total spin momentum is conserved if the system is 
invariant under a simultaneous rotation of all the co-ordinates, both 
space and spin. This is the case for the general free atom Hamiltonian 
even when spin-orbit coupling is included. 

The argument of section 9.6 implies that J 2 has a discrete set of 
(2 J +1) eigenfunctions, 2/ an integer, with common eigenvalue /(/ + Yfi 2 
Each of these may be chosen to be an eigenfunction of J z with separate 
eigenvalues 

Mjh=Jh,(J-l)ti,...,-Jh. (11-14) 

The addition of the total angular momenta of the particles in a system 
follows along the same lines as that leading to equations (1 1 .5) and 
(11.6). 



11.3 Identical Particles 

In quantum mechanics the uncertainty principle makes it impossible to 
define precisely a particle trajectory. Because of this a given particle, in a 
collection of identical particles, cannot be labelled. This is not the case 
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in classical mechanics. A classical particle has a definite trajectory and it 
is possible to distinguish individual particles by following their paths. 
So, in quantum mechanics, unlike classical mechanics, there is, in 
principle, no physical way of distinguishing between identical particles. 
This can be formalized by introducing permutation operators. A permu- 
tation operator Py interchanges the labels of the i and / particles. In 
particular 

Pi2^(ji,o lz ;t2,02z) = H^,o 2 z;r l ,a u ) (11.15) 

where r,-, a iz are the space and spin co-ordinates of the ith particle. As 
identical particles are indistinguishable it is not possible to observe any 
change in a system if two such particles are interchanged. This means 
that if particles are relabelled, a normalized function must only change 
by a phase factor, i.e. 

Pi2Hri,Oiz;r2>02z) = e ict HruO l2 ;r2,o 2z ). (11.16) 

If the interchange is repeated the original wave-function must be 
obtained and so P n has two eigenvalues given by (compare the 
inversion operator), 

e i2a = +l 
i.e. 

e ia =±l. (11.17) 

The quantum mechanical principle of indistinguishability requires 
that a many-particle wave-function behaves in one of two ways when 
two particle are interchanged (i) either the wave-function is unchanged, 
i.e. symmetric, or (ii) the wave-function changes sign, i.e. antisymmetric. 
For consistency, it is clear that a wave-function describing a system with 
three or more identical particles cannot be symmetric with respect to 
some interchanges and antisymmetric with respect to others. The wave- 
function is either symmetric or antisymmetric with respect to all 
interchanges. 

The Hamiltonian describing a system of identical particles must be 
invariant when the particle co-ordinates including spin are relabelled. 
The permutation operators must commute with the Hamiltonian and 
the permutation quantum numbers (1 1 .17) are good quantum numbers. 
So the wave-function is either permanantly symmetric or it is permanently 
antisymmetric, which of these being determined by the nature of the 
particles rather than the state of the system. 

Particles described by symmetric wave-functions are said to satisfy 
Bose-Einstein statistics whereas particles described by antisymmetric 
wave-functions satisfy Fermi-Dirac statistics. It can be shown from 
relativistic quantum theory that particles with integral spin are Bose- 
Einstein particles (Bosons), and that particles with half-integer spin 
(e.g. electrons, protons, neutrons) obey the Fermi-Dirac statistics. 
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It is easily shown that two Fermi particles cannot occupy the same 
position in space and also have the same spin. Consider a wave- function 
describing a system of Fermi particles. If two of the particles are inter- 
changed the wave-function must change sign. However if these two 
particles have the same set of four co-ordinates (space and spin) then 
the wave-function is clearly invariant. 

This implies that the wave-function vanishes when two particles 
have the same set of co-ordinates indicating that this state does not exist. 
This is one form of the Pauli exclusion principle. 

If the relativistic spin dependent terms in the Hamiltonian are ignored 
the wave-function can be written as a product of a function depending 
on the space co-ordinates multiplied by a function of the spin co- 
ordinates. Consider the case of two Fermi particles. The product wave- 
function is 

^(ri,Oiz,r 2 , a 2z ) = <£0i> r 2 )x(oi z , o 2z ). (1 1.18) 

As the complete wave-function must be antisymmetric in the co-ordinates 
(including spin) then 0(r x , r 2 ) must be either symmetric in the space 
co-ordinates, in which case x{°\z, °iz) is antisymmetric in the spin 
co-ordinates or vice-versa. Then the wave-function (1 1 .18) is a simulta- 
neous eigenfunction of three commuting operators, the Hamiltonian, the 
particle interchange operator and the operator that interchanges the 
spin co-ordinates only. The system spin operators Sl p and S z commute 
with the three mentioned above and so the wave-function can also be 
chosen to be a simultaneous eigenfunction of all five operators. These 
wave-functions are the correct zero order functions for a perturbation 
treatment regarding the spin terms as a perturbation. 

Non-interacting particles 

Consider a system composed of n identical particles whose mutual 
interaction can be ignored. The Hamiltonian ^"of the system is the sum 
of n equal Hamiltonians for the individual particles, i.e. 

n 

JC = 2* **i 

i=l 



with 



h 2 



•&, = -— Vf + V(r h o i2 ) 
2m 

where V? operates on the position co-ordinates of the z'th particle at r,-. 
The potential Kis the same for each particle. 

As the individual particle Hamiltonians <#J all commute with each 
other and with Jf, wave-functions can be found that are simultaneously 
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eigenfunctions of jf and all the .#/. These are the variable separable 
eigenfunctions 

\p(r u a lz ; r 2 , o 2z , . . .; r„, a nz ) = a (r^ a l2 ) . . . <pQ„(t„, a nz ) 

(11.19) 



where 



^i«a(ri, °\z) = E Qi 4> Qi {x u a lz ) 



and 



#ty=Ety with £"=2 %•• 

i=i 

The subscripts Gi, ■ • •> Gn indicate the completely defined (including 
spin orientation) single particle energy states occupied. If the co- 
ordinates of any two particles are interchanged the wave-function 
(11.19) remains an eigenfunction of the system Hamiltonian with the 
same energy. As there are n! permutations of n objects then n\ such 
eigenfunctions can be generated. Note that none of these eigenfunctions 
is in general symmetric or antisymmetric with respect to particle inter- 
changes. However the sum of all n ! functions is clearly symmetric. If the 
particles are electrons (i.e. Fermi particles) the many electron wave- 
function must be antisymmetric. Such a wave-function is obtained by 
forming the sum of all wave-functions obtained from (1 1.19) by an even 
number of particle interchanges and substracting the sum of all wave- 
functions obtained by an odd number of interchanges. This antisymmetric 
wave-function can be expressed as the determinant 

Hri,Oi z ;...;r„, a nz ) 



1 



\fn\ 



4>Q l 0"l . °lz) 0g, (>"2» 2 z) ■■■<i>Q l ( r n> a nz) 
<PQ 2 ( T 1> °lz) 0Q 2 ( r 2. °2z) ■ ■ ■ <t>Q 2 (tn, °nz) 



<l>Q n (*U °\z) ■ ■ ■ 



0Q„('n. <?„ 2 ) 



(U.20) 



If the single particle wave-functions are normalized (and orthogonal) then 
so is i//. Clearly if the co-ordinates of any two particles are interchanged 
the determinental wave-function (1 1 .20) changes spin. (Note a deter- 
minant changes sign when two rows or two columns are interchanged.) 
Note that if two of the occupied one-electron states are the same the 
determinant (1 1 .20) has two rows equal and so vanishes. The system 
Hamiltonian has no solution for which the completely defined one 
electron states are occupied by more than one electron. This is of course 
the Pauli exclusion principle for non-interacting particles and helps to 
explain the periodic system of the elements. At most there can be two 
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electrons in each hydrogen type atomic orbit and they must have 
opposite spins. 
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The Helium atom 

In section 7.2 the ground state of the Helium atom was discussed using 
the variational method. The ground state and lowest excited states will 
now be investigated using first-order perturbation theory. The Hamiltonian 
operator for the Helium atom, ignoring spin dependent terms and the 
translational energy, is 

» 2 e 2 

■+- 



= -— (v?+v3)- 

2m 



7e l 



2e> 



47re ri 4Tie r 2 47re |ri -r 2 l 



(7.17) 



where the two electrons are at r t and r 2 respectively. The spin momenta 
operators of each particle and of the system all commute with Jf and 
the total spin quantum number can only take the values S = or S = 1 
corresponding to antiparallel and parallel spins respectively (section 1 1 .2). 
It will be shown below that even when spin dependent terms in the 
Hamiltonian are ignored, the allowed energies of the system do depend 
upon the total spin quantum number. This 'exchange' effect arises 
because of the use of a determinantal wave-function. 

If the electron-electron term in (7.17) is ignored the exact ground 
state occurs when the two electrons with antiparallel spins occupy the 
hydrogen type (Is) state (Pauli principle). This is the atomic configuration 
(Is) 2 and the wave-function is 



<K*i, o lz ;t 2 ,a 2z ) = 



0is(ri)a( a iz) 0is(r2M<*2z) 

0i,(*OPO*i») *i»('a)/J(02«) 
1 

-j3(a lz )e*(a 2z )] 



V5i 

*l*('l)01i('2) 



■ [a(o u )j3(a 2z ) 



(11.21) 



where 0u(r) is the hydrogen type wave-function 

The spacial part of the determinantal wave-function is symmetric 
whereas the spin part is antisymmetric. The operators Sop and S z have 
zero eigenvalues as the electrons have antiparallel spins and the orbital 
angular momentum number is L = 0. (Note, the operator representing 
the orbital angular momentum of the system commutes with the 
Hamiltonian (7.17).) All states with a given L and 5 form a 'term'. The 
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wave- function (1 1 .2 1 ) represents the 'singlet' term 1 S. The S being the 
spectroscopic notation for a state with L = and the superscript 
represents the spin degeneracy (25 + 1) of the state. (It is hoped the 
reader will not confuse the two distinct meanings given to 5.) 

The nondegenerate determinantal wave-function (1 1 .21) can be used 
to give the energy of the ground state of the Hamiltonian (7.17) correct 
to first order, with the electron-electron interaction term regarded as a 
perturbation, i.e. 

E = 2E(\s) 

+ \\^\s(ji)4>Xs(j2)- A j [ 0u(ri)0i s (r 2 ) drj dr 2 

JJ 47re |r 1 -r 2 | 

where 



E(ls) = j<t>Uri) 



2m 4ne ri 



0i s (ri)dri 



is the energy of a single electron in the (Is) state assuming no other 
electrons present. The first terms represents the kinetic energy of the two 
electrons together with their potential energy in the field of the nucleus. 
The second term represents the electrostatic interaction between the 
electrons. The value of the ground state energy obtained in this way is 
E = — 2-75e 2 /47re a and is not as good as that obtained in section 7.2 
using the variational method. 

The more important excited configurations occur when one of the 
electrons is excited into a hydrogen type level with n = 2 to produce the 
configurations (ls)(2s) or (ls)(2p). It is not difficult to show that matrix 
elements between these two types of excited configurations vanish so 
that the (ls)(2p) states can be dealt with separately. Consider the 
(ls)(2s) configuration. There are four possible determinantal wave- 
functions depending upon the spin orientations of the two electrons. 
By suitably combining two of these determinantal functions, the four 
possible antisymmetric wave-functions can each be expressed as a 
product of a spatial function multiplied by a spin function and can 
be divided into two classes depending upon whether the spin part is 
symmetric or antisymmetric. These are the correct zeroth-order wave- 
functions for a perturbation treatment of the spin-orbit coupling. 

One of the wave-functions has an antisymmetric spin part (and 
hence symmetric spatial part) and corresponds to the singlet term 1 S 
with L=0 and 5 = 0. 

x — [a(o lz )p(o 2z ) - /3(a lz )a(a 2z )] . 
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The remaining three wave-functions have a symmetric spin part and 
correspond to the triplet term 3 5 with L = and 5=1. 

<h =—jE [0i*(ri)02 S (>2) - <t>2s(ri)<t>u(r 2 )] 
x — [a(a lz )j3(a 2z ) + /3(a lz )a(a 2z )] 

>h =- y=" [01s(*l)02s('2) - 02s(ri)0u(r2)l a ( o iz)a(CT2z) 
04 = — ^ [0u('i)02*(r2) - 02 i (r 1 )0 ls (r 2 )]^(ffi z )/3(a 2z ). 

The last three wave-functions all have the same antisymmetric spatial 
part, and vanish if the electrons occupy the same point in space. The 
electrons tend to keep apart in this case and the positive contribution to 
the energy arising from the repulsive electron-electron interaction term 
will be smaller than that arising from the symmetric spatial part of the 
term 1 S. So, it is to be expected that the perturbed 3 5 triplet term will 
lie deeper than the 1 S term. This will be shown to be the case. 

When the electron-electron interaction term is ignored all four states 
(singlet and triplet) are degenerate. Although the unperturbed states are 
degenerate, the perturbation matrix is already diagonal with the above 
choice for the excited wave-functions. To the first order, the perturbed 
energy of the excited singlet state 1 S is the expectation value of (7.17) 
using i and is 

ECS)=E(ls)+E(2s) + J + K (11.22) 

where E(ls) and E(2s) are the energies of a single electron in the (Is) and 
(2s) states respectively and 

2 

J= ff0*s(ri)0a»(r2)^ j f 0u(ri)02*('2)dT l dr 2 

K= [j 0u(rO0iL(r 2 )- [0i i (r 2 )0 2s (ri)dT 1 dr 2 . 

/ is called the Coulomb integral and represents the average Coulomb 
energy of the two electrons in the (Is) and (2s) states respectively. K is 
called the exchange integral and is a direct consequence of using a deter- 
minantal wave-function instead of a simple product. Unlike / it does not 
have an electrostatic interaction explanation and arises as identical 
particles cannot be distinguished. 
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The perturbed energy of the excited triplet term 3 S is 

E( 3 S)=E(\s)+E(2s) +J-K. (11.23) 

Note that the triplet degeneracy of this term is not removed by the 
electron-electron interaction. The difference between the *S and 3 S 
terms is 2K and as it can be shown that K > 0, the triplet terms lies 
deeper than the singlet term as expected. 




, Perturbed 
levels 



Unperturbed level 
4-fold degenerate. 

E - E(U)+E(2s) 



Figure 11.1 Displacement and splitting of the configuration (1 s) (2s) by the 
electron-electron interaction. 



Actually this first-order perturbation treatment is not very good as 
the electron-electron interaction term is too large. However, from 
symmetry considerations, the quantum numbers L and 5 are still good 
in an exact treatment and the 3 5 term is still triply degenerate. When 
the spin-orbit coupling term is introduced, the Hamiltonian is no longer 
invariant under a rotation of the space or spin co-ordinates separately 
and L and S are no longer good quantum numbers. However, it is 
invariant under a simultaneous rotation of all co-ordinates and the 
states can be classified by the total angular momentum quantum 
number /. 



The hydrogen molecule 

Even though this is the simplest molecule involving more than one 
electron no exact solution exists. The Hamiltonian for the molecule, 
ignoring spin dependent terms and taking the nuclei to be fixed, is 
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where r la , r lb are the distances of the first electron from the protons 
which are at a and b and similarly r^ , r 2 b are the distances of the second 
electron from the protons. The electron separation is r 12 and the protons 
are a distance R apart. 




l +e 



Figure 11.2 The hydrogen molecule. 



In Chapter 7 the molecular orbital method was applied to the hydrogen 
molecular ion and it was shown that the ground state orbital is the 
gerade, g state 

V * V2(l+A) 

where 01, 02 are the normalized (Is) wave-functions centred on the two 
protons and A is the overlap integral. This represents a bonding state in 
which electron charge is transferred to the bond. This molecular orbital 
corresponds to a binding energy for Hj of ~ 1 -8 eV compared with the 
observed binding energy of 2-8 eV. (Binding energy of a molecule = 
ground state energy of isolated atoms — ground state energy of 
molecule.) 

Clearly this simple model is not very good although it does illustrate 
the bond concept. Electron charge is transferred to the space between 
the atoms and holds them together. Confining an electron in a bond will 
increase its kinetic energy and binding will only result if the decrease in 
potential energy is greater than the increase in kinetic energy. 

The hydrogen ion bond orbital can take up another electron with 
opposite spin and the energy of the electron pair in the bonding orbital 
will be lower than that of the two isolated hydrogen atoms and binding 
energy will result. In this crude model, the two-electron wave-function 
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for the hydrogen molecule is (c.f. (1 1 .21)) 



^g(ri,ai z ;t 2 ,a 2z ) = —i= 



= ^(ri)^(r 2 ) —= [a(a lz )j3(a 2z ) 

-P(°lz)<x(°2z)]- 

The symmetric spatial part may be expanded to give 

^('i)^('a)*^7j^j[^i('i)^i(»2) + *a('i)*a('2) 

+ »/'i(«-i)V'2(r2) + ^i(»-2)'/'2(ri)]- 

This wave-function gives equal weight to the ionic states in which the 
two electrons are on the same atom and the remaining states in which 
each atom has one electron. Electrons will repel each other and this 
suggests that the ionic states should not be as important as this. This 
correlation may be taken into account by configuration interaction in 
which the two-electron wave-function is expressed as a linear combination 
of determinants. 

The lowest determinant that will mix in is that which is obtained by 
exciting both electrons to the antibonding orbital ty u (r). The spin part 
of the two-electron wave-function i/>„(ri, a lz ; r 2 , a 2z ) is antisymmetric 
as above and the spatial part is 

'friifriWiifa) = 2(1 _ A) [^i^i)^i(r 2 ) + ^2(Ti)^ 2 (i 2 ) 

- <Pi(ti)^i(t 2 ) - ^i(r 2 )i/'2('i)]- 

By taking a suitable combination of the two determinantal wave- 
functions the ionic terms may be reduced or eliminated. The energy 
may be is minimized in this way and the resulting combination is such 
that ionic terms are relatively unimportant. 

It would appear that a better first wave-function would be to take the 
electrons on different atoms to begin with. This is the basis of the Heitler- 
London approach. There are two possible functions to consider. 

(i) There is the singlet state with symmetric spatial part (5 = 0) 

1 



V2(l +A 2 ) 



[^i(Xi)\p 2 (T 2 ) + \p 2 (T l )\jJ 1 (t 2 )] 
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(ii) and also the triplet state with antisymmetric spatial part (5=1) 

1 



V20-A 2 ) 



WiCri^fa)- M'lWifa)]- 



(The 2 denotes zero orbital angular momentum about the molecular 
axis.) 

Detailed analysis indicates (problem 8) that the singlet state lies 
deepest and so represents the ground state. This is the opposite situation 
to that occurring in helium. However it is to be expected that the strongest 
binding for hydrogen will occur when the electrons tend to accumulate 
between the protons. This will occur when the electrons can occupy the 
same space state and hence have antiparallel spins. 

Some electron correlation is already built into this method and the 
energy can be minimized by varying parameters in i^i and 4>2- This 
method is obviously correct in the limit of infinite separation of the 
atoms. 

The Heitler-London wave-function and the best two-determinant 
molecular wave-functions are of equivalent accuracy and both involve 
an error of about 0-5 eV in the calculated binding energy. The observed 
binding energy of the hydrogen molecule is 4-7 eV. For more details the 
reader is referred to the article by Coulson and Lewis in Quantum Theory, 
Vol. II, edited by Bates. 



Rotational spectra of homonuclear molecules 

The previous discussions have concerned the identity of electrons. A 
homonuclear diatomic molecule (e.g. H 2 ) contains another pair of iden- 
tical particles these being the two nuclei. The complete wave-function 
describing the system of electrons and nuclei must be either symmetric 
or antisymmetric under an interchange of nuclei depending on whether 
they have integral nuclei spin or half -integral nuclear spin respectively. 
The proton is a Fermi particle and so the hydrogen molecular wave- 
function, including nuclear terms, must be antisymmetric under an inter- 
change of the nuclei. The nucleus in (^0)2 has zero spin and so the 
complete wave-function must be symmetric under an interchange of 
nuclei. 

Using the Born-Oppenheimer approximation the molecular wave- 
function may be written 

\p = ^(el)\|/(nuc) 

where i//(el) describes the electron motion relative to fixed nuclei (this is 
the molecular wave-function discussed in the previous section), and i//(nuc) 
describes the nuclear motion. 
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Further, the nuclear wave-function may be written (ignoring nuclear- 
spin-molecular coupling) 

i//(nuc) = t//(vib)i|/(rot)i//(sp) 

where \[/(vib) describes the vibrational motion of the nuclei, \l/(rot) 
describes the rotational motion and vK s P) depends on the nuclear spin. 
The vibrational function depends only upon the separation of the nuclei 
(4.30) and is unchanged by interchanging the nuclei. The rotational 
function can be written (Chapter 4) (c.f. hydrogen atom functions) 

iKrot) =Pk(cos d)e im,t > K = 0,1,2,.... 

Interchanging the nuclei corresponds to 

and from the discussion of parity earlier in this chapter it is clear that 
i//(rot) is symmetric (anti-symmetric) if K is even (odd). 

Consider the hydrogen molecule. The protons are Fermi particles 
with spin £ and the complete wave-function must be antisymmetric 
with respect to interchange of the protons. The analysis given earlier for 
the electron spin functions of helium applies and there is one anti- 
symmetric nuclear spin function (5 = 0) and three symmetric spin 
functions (5 = 1). The normal ground state of i^(el) is symmetric with 
respect to proton interchange and because of the nuclear spin degeneracy 
there are three times as many odd rotational states as even. At moderate 
and high temperatures in thermal equilibrium three times as many 
molecules will be in odd as in even rotational states. 

If coupling terms between the nuclear spin and the molecule are 
ignored then nuclear spin is a constant of the motion. In fact these 
coupling terms are extremely weak and so a molecule with anti- 
symmetric spin is very unlikely to undergo a transition to a state with 
symmetric spin. The hydrogen molecule may be regarded as composed 
of two independent parts, one with zero nuclear spin (para-hydrogen) 
and the other with spin one (ortho-hydrogen). At normal temperatures 
hydrogen is one-quarter para and three-quarters ortho-hydrogen. Hence 
in a transition between two electron states the rotational fine structure 
of hydrogen shows a 3 : 1 alternation of intensities. The selection rule 
SK = ±1 (applicable for 2 electron states) prevents any transitions 
between electronic states with the same symmetry and explains the 
general lack of infra-red absorption spectra for homonuclear diatomic 
molecules. 

In general, for a homonuclear diatomic molecule, where nuclei have 
spin 7, there are (/ + 1 )(2I + 1 ) symmetric spin states and 1(21 + 1 ) anti- 
symmetric spin states. The intensities of alternate lines in the rotational 
fine structure will then vary as (I + 1): I. Measurements of rotational 
intensities have been used to determine nuclear spins. 
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For a molecule with zero nuclear spin, the nuclei are bosons and the 
total wave-function must be symmetric under an interchange of nuclei. 
The nuclear spin part of the wave-function is a constant and so if i//(el) 
is symmetric (antisymmetric) the only allowed states are those with K 
even (odd). In transitions between two electronic states of different 
symmetry, alternate lines in the rotational fine structure will be missing 
because of the selection rule for K. 

If the two nuclei composing the molecule are different isotopes of the 
same element then they are no longer identical and the above discussion 
does not apply. 



11.4 Atomic Energy Leveb 

It is very difficult to solve problems involving more than one electron 
and complicated numerical procedures are necessary even in the simplest 
case. The difficulties arise because the electron-electron interaction 
contains terms of the form e 2 /47re lr,- — f/ 1 involving the spatial co- 
ordinates of two electrons. However there are approximate methods for 
reducing the many electron problem to a one-electron self -consistent 
field problem. 

Hartree 's self-consistent field 

Ignoring the translational energy and the spin-orbit interaction terms, the 
Hamiltonian operator for a free atom of atomic number Z with n 
electron is 



- 1 



i -- 



h 2 , Ze 2 



•v; 



2m 4ne ri 



. n n 
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+ ■ 
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(<*/) (11.2) 



The first term represents the kinetic energy of the electrons and the 
second term represents the attraction between the nucleus and the elec- 
trons. The third represents the electron -electron interaction, the sum- 
mation being over all pairs of electrons. 

The Hartree equations may be obtained by assuming a product wave- 
function of the form 

iKr,, r 2 , . ■ ., r„) = <t> Ql (ri)tf> e2 (r 2 ) . . . tf> G „(r„). (1 1 .24) 

It is essential that the one-electron functions be normalizable. Of course 
(1 1 .24) is not antisymmetric with respect to interchange of any two 
electrons but the single electron states are chosen in agreement with the 
Pauli exclusion principle in that each completely defined occupied state 
occurs once only. 
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The best approximation to the ground state by a solution of the form 
(1 1 .24) is obtained by minimizing the expectation value of the 
Hamiltonian. So, the best Hartree product solution is that which 
minimizes the integral 

J . . . J ^*(r 1; r 2 , . . ., t„)j^\I/(t u i 2 , . . ., r„) dr, dr 2 . . . dr„ 

(11.25) 
subject to the restrictive conditions 

J 0e,-Oi)0 ei .Oi) dr, = 1 / = 1 , 2, . . ., it (11 .26) 

The method of Lagrange multipliers may be used. Consider the 
function 



l ST* XT* £ 



^a(ri)- ••*(?„ (fn)^ ...d 



where 



2ff2 



Ze 2 
47re r,- ' 



and X/, /= 1 ,...,« are the Lagrange multipliers. 

Suppose each of the one-electron functions <t>Qi(ti) undergoes a small 
change 84>Q^tf). For a stationary value of the integral (1 1 .25) subject to 
the conditions (1 1 .26) it is necessary that 
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<t>Q k (ric) ■ ■ ■ <t>Q n (*n) dri . . . dr„ + _ 2 */ 
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j<t>Q k (*k)54>Q k (rk)&T k 



i.e. 



+ j 5<t>Q k (r k )<t>Q k (i k ) dr fc 
i',/*k 

V e 2 fl*a( r /)l 2 

+ / 1 ■ i dT/ + X& 



0Q k ( r fc) dr * 



+ (a corresponding expression in 84>Q k (r k )) = 0. 



(11.27) 



&<t>Q k and 8<j>Q k may be regarded as independent variations and the 
coefficients of these in (1 1 .27) must be zero . The coefficient of &.<t>Q k is 

„2 H^fc.MS 



= £=1,2 « 






(11.28) 



where 



<?*=- 2 f0e,(r / )^/0 e/ (r / )dT / 
i # it - ' 

2 %f rf4jre oJ lrj-r/1 ; 



i,j*k 



(11.29) 



Note that e k is real and that the coefficient ofS4>Q k is simply the 
complex conjugate of (1 1 .28). (Note that the operators Ai are 
Hermitian.) 

In conclusion, the best Hartree product, of the type (1 1 .24), is 
composed of normalized one-electron functions that satisfy the n 
Hartree equations (1 1 .28) with k = 1 , 2, . . ., n. The one-electron 
function 4>Q k (ii) is an eigenfunction of the Hamiltonian 

|2 
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with eigenvalue e k - Each of these terms may be given a physical signifi- 
cance. The first term represents the kinetic energy of the Arth state 
electron, the second terms represents the interaction between the electron 
and the nucleus and the final term represents the Coulomb energy of the 
electron in the average field produced by all the other electrons. The 
Hamiltonian is not the same for all electrons and the eigenf unctions are 
not orthogonal. 

The set of n non-linear integro-differential equations (1 1 .28) is too 
difficult to solve directly. Hartree (1 928) suggested a self-consistent 
approximation. An initial set of one-electron functions is assumed and 
used to obtain the Hamiltonians (1 1 .30). The Hartree equations (1 1 .28) 
are then solved numerically and a new set of one-electron functions 
obtained. This process is continued until the functions are self-consistent 
to the required order of accuracy. In practice the third term in the 
Hamiltonian (1 1 .30) is approximated by a spherically symmetric term 
by averaging over all directions and the one-electron functions can then 
be characterized by / and m t quantum numbers. 



The Hartree-Fock self-consistent field 

The variational method used above was suggested independently by Fock 
and Slater in 1930 after Hatree obtained the equation (1 1 .28) directly by 
physical arguments. 

The simple product wave-function (1 1 .24) is not antisymmetric with 
respect to interchange of electrons. The Hartree-Fock method starts by 
assuming the wave-function to be the antisymmetric determinant 



0& ( r i' a iz) • • • ( t>Q l ( r n, °nz) 

4>q„(!lOiz) 4>Q„{r„, a nz ) 

(11.31) 



iHr 1 ,o lz ;...;r nz o„ z )=— y= 
y/n) 



The expectation value of the Hamiltonian (1 1.2) obtained using the wave- 
function (1 1 .31), is minimized subject to the condition that each one- 
electron function is normalized. These one-electron functions must be 
linearly independent so that it is possible to form an orthogonal set from 
them (Adams, 1961). 

The set of equations which the spatial parts of the one-electron 
functions satisfy can be written in the 'standard form' 

3«' k <l>Q k (j l ) = e k <p Qk (t l ) * = l,2,...,7i (11.32a) 
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where the one-electron Hamiltonians are given by 
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(11.32b) 



The first two terms are just the first two terms of Hartree's- 
Hamiltonian (1 1 .30). The third term is summed over all the ground state 
wave-functions including the kth and represents the Coulomb energy of 
the kth. state electron in the average field of all the electrons including 
itself. This Coulomb interaction of an electron with itself is clearly not 
acceptable and must be compensated somehow by the fourth term. This 
new 'exchange' term is summed over all wave-functions with spin parallel 
to the kth. In fact if only the element in this summation with/ = k is 
used, the Hartree-Fock and Hartree-Hamiltonians are identical. 

Of course, the equations (1 1 .32a) have an infinite number of solutions. 
There are the n one-electron wave-functions which are occupied in the 
ground state and also an infinite number of solutions corresponding to 
unoccupied excited states. All the Hartree-Fock solutions are orthogonal 
to each other. This is not the case for the Hartree solutions. 

The total energy of the system is the expectation value of the 
Hamiltonian (1 1 .2) obtained using the determinantal wave-function 
(1 1 .31) and is less than the sum of single particle energies 2 e k as this 
sum counts each electron-electron interaction twice. However the mag- 
nitude of the single particle energies are approximately equal to the 
ionization energies when only one particle is excited. 

It is to be expected that as an electron moves it will repel other 
electrons and a 'coulomb hole' will be formed about it. There is corre- 
lation between the motion of the electrons. The Hartree-Fock wave- 
function does take some account of correlation between electrons with 
parallel spins but it ignores correlation between electrons of unlike spin. 
Of course correlation reduces the energy of the system by keeping 
electrons apart and the Hartree-Fock method is a better approximation 
than the Hartree method which completely ignores correlation. The 
difference between the energy of the system obtained using the Hartree 
and the Hartree-Fock methods is called the 'exchange energy'. The 
reader is warned not to give too much physical significance to this as it is 
simply the difference between two mathematical approximations. The 
Hartree-Fock energy E Hr is an upper bound to the exact ground state 
energy E. The energy difference (E — #h.f.) is called the 'correlation 
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energy' and is a significant measure of the accuracy of the Hartree-Fock 
method. The correlation energies for atoms are of the order of a few 
electron volts. 

In practice the Hartree-Fock equations are solved numerically by 
iteration until self -consistent solutions are obtained. 

The infinite set of solutions of the Hartree-Fock equations forms a 
complete set of one-electron functions. The infinite set of n x n deter- 
minantal wave-functions constructed by taking different combinations 
of n Hartree-Fock solutions form a complete set of n-electron anti- 
symmetric wave-functions. The exact solutions of Schrodinger's many- 
electron equation may be expressed as linear combinations of these 
determinantal wave-functions. However in many cases a single determinant 
is fairly reasonable. 

The periodic table and the shell model 

Usually the coulomb and exchange terms in (1 1 .32) are averaged over all 
directions to make the n spherically symmetric and the one-electron 
functions are then characterized by / and mi angular momentum quantum 
numbers as well as by a principle quantum number n. (The number of 
nodes, excluding the origin, of the radial part of the one-electron function 
is equal ton - I - 1 .) 

The one-electron states are denoted by 

(Is) when n = l, 1 = 
(2s) when n = 2, 1 = 
(2p) when «=2, 1=1 



with the same notation used for hydrogen type states. In this general 
(not coulomb) central field approximation the energy of the zth electron 
depends upon both n t and /,-. 

Suppose the Hartree-Fock equations (1 1 .32) are equivalent to a 
variable separable, central field Hamiltonian for the system with the 
form 






2m 



(11.33) 



where «,(r,) are suitable one-electron potentials. (By further approxi- 
mating the exchange terms in (1 1 .32) Slater (1951) has been able to 
show this.) Then the determinental wave-function formed by solutions 
of the Hartree-Fock equations are eigenfunctions of the central field 
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Hamiltonian Jf - This Hamiltonian is invariant under separate rotations 
of the co-ordinates of each particle and so commutes with the separate 
particle angular momentum operators. Its eigenvalues depend only upon 
the set of quantum numbers («,-, /,•), i = 1 , 2, . . ., n associated with the 
one-electron functions in the determinantal eigenfunctions. The energy 
is independent of the quantum numbers ma and m S i as there are no 
spin-orbit interaction terms. 

This central field 'atomic' energy level is called a configuration 
(sometimes 'configuration' refers to the set of n associated one-electron 
functions) and its degeneracy is less than or equal to 

2" ]1 (2/, + l) (11.34) 

i = i 

as each one-electron energy has degeneracy 2(2// + 1). In general the 
Pauli principle prevents the degeneracy (1 1 .34) from being attained and 
also limits the configurations allowed. 

A central field single electron state is completely defined by the set of 
four quantum numbers (n, /, mi, m s ). Electrons are Fermi particles and 
the antisymmetry of the many-particle wave-function demands that only 
one electron can occupy a completely defined state. This is the Pauli 
exclusion principle. The combination of the Pauli principle with the 
Hartree-Fock method gives an explanation of the periodic system of 
elements. 

In the ground state the one-electron atomic levels of lowest energy 
are filled up first, the Pauli principle preventing all electrons from 
occupying the lowest (Is) state. The chemical properties are mainly 
determined by the 'valence' electrons in the shell (defined by n, I) of 
greatest energy. If the outermost shell is completely filled and there is a 
considerable energy gap to the next empty shell then it is to be expected 
that the element will be inert. The rare gases are typical examples. 

Atomic hydrogen has the configuration (Is) in the ground state. The 
next atom, helium, has both electrons in this shell and the configuration 
is denoted by (Is) 2 . This is a closed shell and helium is inert. The next 
element, lithium, has the configuration (ls) 2 (2s) and has one electron 
outside a closed shell. Such elements with one electron in an s-shell 
(apart from hydrogen) are chemically similar and these are the alkalis. 
Atoms with a full p-shell (or (ls)-shell) are inert and are the rare gases. 

The configuration of atoms with more and more electrons can be 
obtained in this way and imply periodicities in the chemical properties 
of the elements. The number of electrons in a completely filled shell 
depends only upon the /-value and is given by 2 (2/ + 1 ). For the s, p, d 
states the number of electrons is 2, 6, 10, respectively. 

Complications do arise because the one-electron energies depend upon 
both n and /. For example, the shells (4s) and (3d) have similar energy 
and they may not be filled up in sequence. Another transition period 
occurs because of the similarity in energy of the shells (5 s) and (4d). 
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The nuclear shell model is an attempt to explain nuclear properties in 
a similar manner. An atomic nucleus is composed of protons and neutrons 
both of which are Fermi particles with spin 5 and with similar masses. 
These nucleons interact through a short range (~10~ 1S m) attractive 
nuclear force. (The protons of course also repel each other by the long 
range coulomb force.) The shell model assumes that each nucleon moves 
in an averaged field due to the other nucleons. For many nuclei, this 
field can be taken to be spherically symmetric and this means that each 
nucleon state can be classified by (n, I) quantum numbers. However 
spin-orbit coupling is relatively important and the energy levels also 
depend upon, to a large extent upon/. States of similar energy may be 
grouped together and called nucleon shells but the numbers of allowed 
nucleons in the shells are different from the electrons because of the 
strong spin-orbit coupling. The neutrons and the protons form their own 
shells but the energy difference is very small for the lighter nuclei. The 
three lowest shells can contain up to 2, 6 or 12 protons or neutrons 
respectively. A nucleus containing 2, 8 or 20 . . . protons (or neutrons) 
will have completed proton (neutron) shells. These are the 'magic 
numbers'. A nucleus with completed proton and neutron shells may be 
expected to be stable. The isotope ^He is a typical example. The nuclear 
shell model has proved useful in considering nuclear properties. 

Electrostatic interaction and terms 

The atomic configuration degeneracy described above does not really 
occur as the averaged central field potential used in the Hartree-Fock 
equations (and hence in Jf? ) is only an approximation to the actual 
electrostatic interaction between the electrons. The difference between 
the two Hamiltonians is (from (1 1 .2) and (1 1 .33)) 
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i u,(rj) (11.35) 



and may be regarded as a perturbation. This is a degenerate perturbation 
problem and the correct zeroth order functions must be used. The 
functions required are characterized by L, S quantum numbers. This is 
explained below. 

The Hamiltonian (1 1 .2) unlike 3^C is not invariant under a rotation 
of the spatial co-ordinates of any one particle taken alone and so does 
not commute with the individual electron angular momentum operators. 
The //, 1 = 1 , 2, . . ., n are not good quantum numbers. However this 
Hamiltonian is invariant under a simultaneous rotation of all the spatial 
(or spin) co-ordinates and so commutes with the system angular (and 
spin) momentum operators. The energy of the system depends upon the 
system momentum quantum numbers L and S. The Hamiltonian (1 1.2) is 
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spin-independent and it may appear that the energy should only depend 
upon L. In fact the energy does also depend upon S, and this is an 
exchange effect. (See discussion of the Helium atom (1 1 .3).) 

The determinantal wave-functions belonging to a given configuration 
form the basis for a reducible representation of the symmetry group of 
the Hamiltonian (1 1 .2). 

Repeated use of the vector coupling rule (1 1 .5) or (1 1 .6) may be used 
to give the possible irreducible representations and hence the allowed 
values of L and 5 that are included. Once more, application of the Pauli 
principle may exclude some of these values of L and S. 

By taking suitable combinations of the determinantal wave-functions 
belonging to a configuration it is possible to construct sets of 

(2L + 1)(25 + 1) (11.36) 

functions for each allowed pair of values (L, S) which transform according 
to D( L ) and D^ under spatial and spin rotations respectively. Each set 
of wave-functions belongs to the same energy level, called a spectral 
term, of the Hamiltonian (1 1 .2) and the degeneracy of the term defined 
by (L, 5) is given by (1 1 .36). 

The wave-functions for a given term {L, S) do not consist entirely of 
linear combinations of the wave-functions from the corresponding 
configuration. They include small contributions from terms, with the 
same values of L and 5, belonging to different configurations. This effect 
is called configuration interaction. The energy difference between terms 
associated with a given configuration is small compared with the energy 
difference between configurations. 

As an illustration consider the following examples. The helium atom 
ground state has the configuration with both electrons in the (Is) state. 

(Is) 2 . (11.37) 

These electrons have the same values of n and / and are said to be 
equivalent (1 1 .34) gives this configuration a maximum possible degeneracy 
of 2 2 = 4. However this is clearly incorrect as the Pauli principle requires 
that the two electrons have opposite spins and this configuration is not 
degenerate. The allowed values of L are given by the coupling rule 

D (o) xD (o) =D (o) 

i.e. 

L = 0. (11.38) 

The allowed values of S are given by 

z) (l/2) x£) (l/2) =£) (l) +Z) (0) 



S = 0, 1. (11.39) 
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So it would appear that the eigenfunction (1 1 .37) includes the terms 

l S, 3 S. 

This is incorrect since the Pauli principle does not permit the term 3 S 
and the configuration corresponds to the single term 1 S. 
The first excited state of helium has the configuration 

(ls)(2s) 

(1 1 .34) does give the degeneracy correctly as 2 2 = 4 in this case. The 
allowed values of L and S axe given by (1 1 .38) and (1 1 .39) and both the 
terms l S and 3 5 are included in the configuration. 

The next excited state of helium has the configuration 

(ls)(2p) 

and (1 1.34) gives the correct degeneracy 2 2 x 1 x 3 = 12. The allowed 
values of L are given by 

D (o) xD (i) =D (i) 

i.e. 

L = l 

and S = 0, 1 as before. The allowed terms in the configuration are 1 P and 
3 P. These terms have degeneracy three and nine respectively making 
twelve wave-functions in all as required. 

These last two configurations each have different terms with the same 
value of L but different values of 5. Even though the Hamiltonian (1 1 .2) 
does not depend upon the spin co-ordinates these terms do have different 
energies. This is an exchange effect and arises because determinantal 
wave-functions require antisymmetry in all the co-ordinates including 
spin. 

In the previous discussion of the helium atom it was explained that 
the term, arising from the excited configuration (ls)(2s), in which the 
electrons have parallel spins (5=1) has a vanishing probability for the 
electrons to be in the same point in space at the same instant. This 
reduces the positive contribution to the energy arising from the electron- 
electron interaction. This result can be generalized to give Hund's rule. 
The term with the lowest energy obtained from a given configuration is 
that term with the largest value of S and the largest value of L associated 
with that S. 

Fine structure 

The Hamiltonian (1 1 .2) neglects relativistic effects and excludes all 
terms that depend upon the spin co-ordinates. The coupling between the 
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spin and the orbital motion of each electron adds a contribution, 

2c m *-> r t dr.- 
i = i 

to the Hamiltonian where & ix and s ix are the ^-components of the 
momenta of the ith-particle. In addition there are interactions between 
the spin of one electron and the orbital motion of the others and also 
interactions between different spins. 

These terms are of the order of (v/c) 2 and produce the fine structure 
of term energy levels. All spin interaction terms (e.g. (1 1 .40)) involve 
first or second powers of inner products of the vector operators (JS?/„ 
Jiffy, SCf Z ) and (s ix , s iy , s lz ). These inner products are not invariant 
under a separate rotation of the spatial or spin co-ordinates of all the 
electrons but they, and hence the complete Hamiltonian are invariant 
under a simultaneous rotation of all the co-ordinates both spatial and 
spin. That is, the complete Hamiltonian does not commute with the 
operators representing the system angular momentum or the spin angular 
momentum but does commute with the total angular momentum 
operators. L and S are no longer good quantum numbers but / and Mj 
are good quantum numbers. The energy depends only upon/ and so each 
energy level has degeneracy (2/ + 1). 

For the lighter atoms the spin terms are smaller than the electrostatic 
terms and the spin terms can be regarded as a perturbation of the spectral 
term energy level. The (2jL + 1)(2S + 1) degenerate wave-functions 
belonging to a given spectral term (L, S) span a function space that is 
reducible under the new (reduced) symmetry group of the Hamiltonian 
and so this degeneracy will be split by the spin perturbation to give the 
spectral term 'fine structure'. This function space can be separated into 
sub-spaces each of which transform irreducibly under the simultaneous 
orbital-spin rotation group according to D^ where / is given by the 
vector coupling rule (1 1 .6). 

/ = (L +S),(L + S- 1), . . ., \L - S\. (11-41) 

Each of these values of/ corresponds to a separate energy level. The 
values of the fine-structure splittings can be obtained from (1 1 .40) by 
calculating the appropriate matrix elements. 

This approximate method for constructing the wave-functions for an 
atomic energy level from a given term is based on the assumption that 
the 'fine structure' energy differences are small compared with the 
differences between terms. This is often the case and is referred to as 
Russell-Saunders or LS coupling. 

The exact wave-functions for an atomic level with angular momentum 
quantum numbers (/, Mj) will also contain small contributions from 
other terms. 
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For heavier atoms the relativistic spin terms are not small compared 
with the non-central field electrostatic terms and cannot be treated as 
perturbation in this way. A better approximation to the atomic wave- 
functions is obtained by characterizing each electron by its total angular 
momentum/. The non-central field electrostatic terms are then used as a 
perturbation to construct wave-functions with total angular momentum 
/. This scheme is called //'-coupling. Actually this coupling method is 
never completely valid although it may apply for highly excited states. 
It is the opposed limiting case to the LS scheme and the true coupling 
scheme for heavy atoms may lie somewhere in between. 

In addition to the fine-structure splitting due to the electron spin 
there is also a very small splitting of energy levels arising from the weak 
interaction between the spin of the nucleus and the electrons. The 
nuclear spin and the total angular momentum of the electrons can be 
coupled together. This produces the hyperfine structure of the levels and 
the splitting is about 10 3 of the fine structure splitting. 



Selection rules 

It can be shown that when the radiation wave-length is large compared 
with atomic dimensions and the electron-dipole approximation is valid, 
the spontaneous and induced transition probabilities are proportional to 
the square of the modulus of the dipole integral. 



MM 



\jjjdr 



(11.42) 



\jjj and \jjf are the initial and final many-electron states, r k is the position 
of the kth electron and the integration is over all electron co-ordinates. 
(Compare (8.38) for a single electron.) 

£ r fc changes sign under an inversion and since the atomic wave- 
functions defined by (J, Mj) have either positive or negative parity the 
dipole integral (1 1 .42) will vanish unless the transition involves a change 
in parity of the atomic wave-functions. This selection rule is called 
Laporte's rule. 

There are other selection rules and group theory will now be used to 
determine the non-zero transition probabilities. 

Suppose the function (p is one of the basis functions for an irreducible 
representation of a symmetry group. It can be shown that the integral 
over all co-ordinates 

J0dr = O (11.43) 

if the representation is not the unit representation (see Landau and 
Lifshitz, p. 343). As examples consider the hydrogen type one-electron 
functions. The functions which transform according to the irreducible 
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representation D^ of the rotation group are \pp x , \pp y and \frp z and 
clearly 



j\l/ Px dr = \typy dr = j $p z dr = 0. 



However the function which transforms according to the unit 
representation Z)(°) is \p s and 



J** 



dr#0. 



Now the three components of 2£_ t r fc transform amongst themselves 
as a vector under simultaneous rotation of all the electron co-ordinates. 
That is, they transform according to the three-dimensional irreducible 
representation D^\ 

The initial state wave-function for the atom i//,- must transform 
according to some irreducible representation D^fi and so the set of all 
functions (remember \j/ t is 2/,- + 1 fold degenerate) 

l t k *t (11-44) 

fc = i 

transform according to the sum of irreducible representations 

= D <- J t+» + £> (/ <> +D (J '~ 1) J t > 1 
£>(D X D Vti =Z )(3/2) +Z )U/2) /| = J 

= £>< 1 > /,=0. 

Of course the final state of the atom must also transform according to 
some irreducible representation DW and so the integrand of (1 1 .42) 
transforms according to 

£)U/) X £)(1) X £,(//) (11.45) 

This triple product will contain the unit representation only if £)W is 
contained in the product D^ x Z)W. That is, if 



J f = J i + \,J i ,J i -\ 


Jj>l 


~u 


Ji=\ 


= 1 


J, = Q 



The dipole integral (1 1 .42) may be different from zero only for pairs of 
states such that 

5/=+l,-l 

8J = (11.46) 
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except the transition /,• = to // = 0. These are the selection rules for 
allowed transitions. It can be shown that 8Mj = +1,0,-1. 

For lighter atoms in the Russell-Saunders approximation, the spatial 
part of an atomic level wave-function transforms according to D^ under 
space co-ordinate rotations and the spin part transforms according to D^ 
under spin rotations. Since 2jt = 1 r k transforms according to£)( a ) andZ)(° 
under space and spin rotations respectively, then it can easily be shown 
that the selection rules for transitions between Russell-Saunders states are 

5X=+1,-1 

61=0 
except the transition 

L t = to L f = 
and 

55 = 0. (11-47) 

Observe that the total spin is conserved. 

The sodium D-lines 

In the ground state the sodium atom in the self-consistent field approxi- 
mation has eleven electrons in the configuration 

(l5) 2 (2 S ) 2 (2p) 6 (3s). 

The Pauli principle drastically reduces the possible 'degeneracy' of this 
configuration. The inner closed core often electrons transforms according 
to Z)(°) in both orbital and spin space and the only degeneracy allowed 
occurs from the possible values for the spin quantum number s for the 
outer (3s) electron. The ground state configuration is doubly degenerate 
and is represented by the doublet term L=0,S = $, i.e. 2 S. The spin- 
orbit coupling does, not reduce this degeneracy and the atomic level may 
be written 



>l/2 



L=0, 5=i J = l 



The subscript gives the /-value. 

The sodium atom may be excited into a higher energy perhaps by 
applying a suitable field or by heating it in an electric arc. The lowest 
excited configuration is 

(ls) 2 (2s) 2 (2p) 6 (3p). 

There are six such 'degenerate' states corresponding to the three 
different values of / and the two different values of s taken by the outer 



SYSTEMS OF PARTICLES 293 

(3p) electron. For the complete configuration L = 1 , S = \ and so it 
contains the single term 2 P. The allowed values of the total angular 
momentum of the system are 

i.e. 

J = 11 

Consequently, the term 2 /* splits up into the separate energy levels 

■^3/2. 2 ^i/2 
under the spin -orbit interaction. 



(li)W(2j>)«(3») 

r? 



Figure 11.3 The sodium D4ine transition. 

The downward-transition from either of the levels 2 P 3 / 2 or 2 P 1/2 to 
the level 2 Si/ 2 is possible as 

6Z = - 1 , 55 = 0. 

These two transitions are the cause of the D-line doublet in the sodium 
spectrum and correspond to wave-lengths 5.889-96 A and 5895-93 A 
respectively. 



11.5 Second Quantization 

The method of second quantization is very useful for studying systems 
of identical particles. It is necessary in the quantization of fields where 
the number of particles may actually change. For example, photons may 
be created or annihilated in a radiation field. However this work is out- 
side the scope of an introductory text and the second quantization 
scheme will only be applied to electron systems below. 
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Let {0g,-(x)} be a complete set (infinite) of orthonormal one-particle 
functions, x denotes the set of four co-ordinates including the spin- 
orientation of the particle. This set may be combined to form anti- 
symmetric (or symmetric) n-particle wave-functions. For electrons 
(i.e. Fermi particles the w-particle wave-function is anti-symmetry. Let 
& n \xi, x 2 , . . ., x„) be such an anti-symmetric n-particle function 
constructed from the first n functions in the complete set. Then 



<E»W(x„x 2 ,...,x n ) = 



v^T 



0<2 1 (xi)...0q 1 (x„) 



0g„(xi) • - • 0<2„(x„) 



(11.48) 



Similarly, other n- particle determinantal wave-functions can be con- 
structed using other sets of n single-particle functions. In this way a 
complete set of determinantal wave-functions is obtained and an arbitrary 
H-particle wave-functions can be expanded in terms of these n x n 
determinantal wave-functions. 

Each of the determinantal wave-functions is completely specified 
when the number of particles in each single-particle state is specified. 
Let «,- be the number of particles in the state <j>Qj(x). The wave-function 
(1 1 .48) corresponds to 



«1 = «2 



= «» = 1, 



'n + l 



'n + 2 



= ... = 0. 



(11.49) 



The 'occupation numbers' «,• can only take the values or 1 for Fermi 
particles (Pauli principle). This restriction does not apply to particles 
obeying Bose-Einstein statistics. 

The w-particle determinantal wave-functions will now be regarded as 
functions of the occupation numbers of the states 0g,(x) rather than as 
functions of the particle co-ordinates. With this in mind the wave-functions 
will be written 



#("l,«2.«3, • • •) 



(11.50) 



where 



2 ",• = n 

i = l 



(11.51) 



and for Fermi particles n t = 0, 1 . 

The wave-function (1 1 .48) is written 

*(1, 1,1, ...,1,0,0,...). 
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In this new scheme the operators representing dynamical variables 
must act on the occupation numbers rather than on the particle co- 
ordinates. At this point it is necessary to introduce creation and annihil- 
ation operators. The creation operator af operates on a n-electron wave- 
function such as (1 1 .50) to add a new particle in the state 0q,-(x), i.e. 

fl ( + *(/ii,n 2 , • • ••«!, • • •) = (-l)*'0 - w,M"i,"2, ...,«, + 1, . . .) 

(11.52) 

where 

s,= 2 n k , (11.53) 

fc = i 

s t is equal to the number of occupied states up to but excluding 0g,(x). 
(— 1/« is a phase factor, the significance of which will become apparent 
later. Observe that if n t = l, the right hand side of (1 1 .52) is zero in 
agreement with the Pauli principle. A determinantal wave-function with 
two particles in the same state is identically zero. 

An annihilation operator a,- is defined to remove a particle from the 
state e .(x) 

«,$(«!, « 2> . • .,«,,• . •) = (-l) r, 'n/*(Mi,«2» •• •»«/- !>•• •) 

(11.54) 

(af is the Hermitian adjoint of a,). Similar but different expressions 
define the action of the creation and annihilation operators if the 
particles are bosons. 

As n t can only equal one or zero for Fermi particles then from (1 1 .52) 

atat = (11.55) 

and from (1 1 .54) 

tt t a t = 0. (11.56) 

The vacuum state is defined to be that state containing no particles. 
Hence 

«l*vac=« ) («l = l,"2 =0, ...) = Qi (x) 

represents the creation of a single particle in the state 0^ (x). A general 
determinantal wave-function can be constructed from the vacuum state. 

*(n„n 2 , . . .,/,,, . . .) = (aj)"'(4)" 2 • ■ • («*)"' • • • *vac- (H-57) 
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Changing the order of the creation operators in a product corresponds 
to interchanging the particles. When two Fermi particles are interchanged 
the wave-function changes sign. Hence it is necessary that 

(^;>$ = -(«;«;)* 01.58) 

for arbitrary $, i.e. 

[at,a-~] + =0 (11.59) 

where [a, |3] + = a|3 + |3a is the anticommunicator of a and 0. It is to 
satisfy this requirement that the phases are chosen as in (1 1 .52) and 
(1 1 .54). To see this consider 

aUt^iriu •..,«, = 0, . . ., n f = 0, . . .) 

= (- 1) V*(«i , ...,«! = 0, .... n, = 1 ... .) 

= (_l)»/(-l)*i*(n,, . . ..a, = 1, . . .,«, = 1, . . .) />/. 

(11.60) 



Also 



aJatHni, ...,", = 0, . . ., n f = 0, . . .) 
= (-l)*'fly + *(« 1 ,...,n / = l,...,/i / = 0) 
= (-l) s '(-l)*/*(n„ . . .,«,= 1, . . .,«, = 1, . . .) j>i. 

(11.61) 

Since s y ' = s, + 1 then (1 1 .58) is indeed satisfied . 

The annihilation operators satisfy a relationship similar to (1 1.59) 

[«,,«,]+ =0. (11.62) 

There are also commutation relations which involve both creation 
and annihilation operators. Consider the action of the operator a *a ( on 
the determinantal wave-function <&(n lt . . ., «,-, . . .) 

«/*(«!, . . .,«,-, . . .) = (- 1 Y'n, *(«!,. . .,«,- - 1, . . .) 
afaQ(n lt . . .,/!,, . . .) =(-l)*'«i(-l)*'*(»i, . . .,«,, . . .) 
«r«/*(«i » • • •> «,-, • • •) = ",<K" l, • • •> ".', • • •) 
as «,- = 0, 1 for Fermi particle^. The operator 

Ni=aUi (11.63) 

is the number operator for the state 0g,(x). Its eigenvalues are the 
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number of particles in this state. The total number operator is defined 
to be 

N= 2 N t . (11.64) 

1=1 

Consider now the action of the operator a t a* 

«,**(«„ . . .,n„ . . .) = (-1)*'(1 - «/)*(#!!, ...,«, + 1, . . .) 

:. B|fli + *(n„ . . ., «,-, . . .) = (-1)*'(1 -»,)(- 1 )*'(»/ + 1) 

X *(«!,... ,«,-,...) 

i.e. 

a t at^{n u . . ., n h . . .) = (1 - n/)*(« 1; . . ., n,, . . .). (1 1 .65) 

From (11. 62) and (11. 65) 

(afa +a i at)$(n 1 , . . .,«,-, . . .) = *(«! n t , . . .). 

This is a special case of the commutation rule 

a/a,- +a/fl*=l 
or generally 

[at fl .] + = 6// . (11.66) 

From the definition (1 1 .63) and the commutation rules (1 1 .59), 
(11 .62) and (1 1 .66), it is quite easy to show that the number of operators 
belonging to different states commute, i.e. 

pV f ,ty]so. (H-67) 

Consequently functions can be found that are simultaneously eigen- 
functions of all these number operators. These are of course the 
determinantal wave-functions. 
Finally it can be shown that 

Wi,a,\s-6 ti at 1 - 68 ) 

and 

[N h a}]=8 if at. (11.69) 

Representation of observables 

A physical observable is represented in quantum mechanics by an 
operator. Let a(x)be the 'co-ordinate' operator that represents the 
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observable A. As the set {0gj(x)} is complete the effect of a(x) on the 
single particle function 0g,(x) is 

<*0G,-(x)= 2 a/,-0 2/ (x) (11.70) 

7=1 

with 

a ;i = )<l>Qj<X<t>Q i dT x . 

(The integration includes a sum over spin co-ordinates.) The right hand 
side of (1 1 .70) can be written in terms of creation and annihilation 
operators. 

oo / oo oo \ 

2 a^a^Q^x) =22 afo k a k e ,(x) 
/ = 1 \j = 1 k = 1 / 

/ oo oo \ 

= 22 afa jk a k ){at^ vac ). (11.71) 

From (11.70) and (11.71) it is seen that the operator a(x) which acts on 
functions of the co-ordinates x can be expressed as an operator acting on 
the occupation numbers «,. 

«(x) = 22a/ , a/ta t . (11.72) 

This result has been shown for the simplest case of a one-electron 
function but it can be extended. 

Let a(x lf x 2 , . . ., x„) be an additive operator for a system of n 
particles such that 

a(x 1 ,x 2 , ...,x„) = a(x 1 )+a(x 2 ) + .. . +a(x„). (11.73) 

This operator is symmetrical with respect to all the particles and it can 
be shown that 

a(x 1 ,x 2> . . .,x„) = 22a-'a lc a lc (11.74) 

witha /k = f <!>QjOc(x)^Q k dT x as before. 

As an example consider a system of n identical particles whose mutual 
interaction can be ignored. The system Hamiltonian is the additive 
operator 



ie= 2 *r (n.75) 

r =1 



with 



ffS ~hi V ' 2 + F(x ' ) - 
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The simplest non-trivial case is a system of two particles. The wave- 
function for the system can be expressed as a linear sum of 2-particle 
determinantal wave-functions of the form 

where the phase of the determinant has been sensibly chosen. 
The effect of the system Hamiltonian on this determinant is 

[Jfi + ^l-/=- [0G,(xi)0e m (x 2 )-0Q m (xi)0e/(x 2 )j 



1 



0Qm( x 2) 2 /*/70Q/(Xi) - 0(2/ (X 2 ) 2 ^/m0(2/(Xl) 



+ 0G/( x i) 2 ^/m0e/(x 2 ) - 0G m (xi) 2 ^/70G,(x 2 ) (1 1 -76) 



as 



Jt r 4>Qi(x r ) = 2 ^,70<2/(Xr) for r = 1 , 2. 
The right hand side of (1 1 .76) is 

2 6,1 —E [0Qm( x 2)0e/(Xl) - 0(2m(Xl)0<2/(X2)] 

-2^/m -^ [-0(2/(xi)0(2/(x2) + 0(2/(x2)0e/(xi)]. 

In terms of the creation and annihilation operators this becomes 
2 [^//fl/am^vac - */m«/V*vac] - ! - 77 ) 

It is quite easy to show that (1 1 .77) is equal to 
2 2 df A ik a k [fl/ + am$vac] 

and hence deduce that 

i k 

as required. For a system of n non-interacting particles, the Hamiltonian 
in the occupation number representation is 



2.2.a/'4 lk a k . 



(11.78) 
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Suppose the expansion set {4>Q t (x)} is chosen to be the set of eigen- 
functions of the one-electron Hamiltonians so that 

and 

*lk=E Qi b lk . (11.79) 

The Hamiltonian (1 1 .78) becomes 

IEq^o,- = 2E Qi N f (1 1 .80) 

and its eigenfunctions are also the simultaneous eigenfunctions of the 
number operator Nj. The eigenvalues are 

2Eq, rij rij = 0,1 and 2 «/ -n. (1 1 .81 ) 

Unfortunately the operators in many particle systems are not always 
additive in the sense of (1 1 .73). The particles often interact with each 
other and co-ordinate operators occur which can be written 

0(x lt x 2 , . . ., x„) = 2 ! 2/3(x r , x s ), (1 1 .82) 

r > s 

where 0(x r , x s ) operates on functions involving x r or x s . It can be shown 
that 

P(x„ x 2 , . . ., x„) = \ 2 1,22pjl,ata?a,at, (1 1 .83) 

i j k I 

with 

For example, the Coulomb interactions between electrons adds a term 

e 2 



12>(r r -i,) where V(t r - t s ) = 

r>s 4ne 1 t r - r s \ 

to the w-particle Hamiltonian (1 1 .75). i r is the spatial part of x r . The 
Hamiltonian in the occupation number representation is then 

2 2 af 4, k a k + 1 2 1 2 2 «,V V'l a,a k (1 1 .84) 

j k i j k I 

where h /k is defined as before and 
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Electron field operators 

The rules for finding the form of the operators, representing observables, 
in the occupation number representation can be stated in terms of new 
'electron field' operators * + (x) and *(x) defined by 

* + (x) s 2*5,(xW 9(x) = 2^Q t 00a,. (11.85) 

i i 

The operator ^(xo) creates a particle with the co-ordinates 
(including spin) x . 

It is quite easy to show that these operators satisfy the commutation 
relations 

[*(x),*(x')] + =0 

[*(x), * + (x')] + = 6 (x - x) (1 1 .86) 

where 

5(x - x') = 8(x - x')8 (y - y')6(z - z')5 CTz <^. 

These operators are of interest because the equations (1 1 .74) and 
(1 1 .83) can be written 

a(x,, x 2 , . . ., x„) s jV(x)a(x)*(x) dr x (1 1.87) 

0(xi, x 2 , • • ., x„) = i fj * + (x)* + (x')0(x, x')*(xXx) dT x dr' x 

(11.88) 

respectively. The reader is asked to confirm that the right hand sides of 
(1 1 .87) and (1 1 .88) agree with the right hand sides of (1 1 .74) and 
(1 1 .83) by substituting (1 1 .85) for the electron field operators. 

The electron field operators play a similar role to that of the wave- 
function in the Schrodinger theory. In the second quantization formalism 
wave-functions are in some sense replaced by operators. 

Bosons 

The work above has been specifically oriented towards Fermi particles. 
There are differences in the formalism when applied to bosons. The n- 
particle wave-functions 4>(n ls n 2 , . . ■) are now symmetric in interchanging 
any two particles and the occupation numbers can exceed unity. 
Creation and annihilation operators are defined by 



a?$>(n u n 2 ,. . .,n h . . .) = y/n t + 1 &(n 1 ,n 2 ,.. .,«, + 1,..-.) 
a i $(n 1 ,n 2 ,...,n i ,. . .)= Vn7$(n 1 ,n 2 , . ..,«,-!,...) 
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respectively. The commutation relations are also different, e.g. I 

[a!,at}=0. (11.89) 

This development is outside the scope of this text although it is 
certainly important in many problems, such as in a discussion of photons 
in an electromagnetic field or in the theory of phonons in lattice 
vibrations. 

The interested reader is referred to Introductory Quantum Electro- 
dynamics, by Power, and to Concepts in Solids, by Anderson. The latter 
text has an interesting derivation of the Hartree-Fock equations in second 
quantization formalism (p. 15). 

PROBLEMS 

1 Use the vector coupling rule to show that 

(a) dWxDW=dW+D< 5 M 

(b) £<» xD w xZ) (1 > =Z) (3) + 2Z> (2 > +3£> (1) +£> (0) 

(c) DW> x D^ x D (V2) = D (3/2) + 2D (i/2) 

2 Show that the inversion operator is Hermitian. 

3 The electric dipole moment of a system of n particles each of charge 
e can be written 

n 

P = e 2 «* 
k = \ 

where t k is the position vector of the kth particle. Explain why P 
changes sign under the action of the inversion operator. 

Deduce that if the state function i//(ri, r 2 , . . ., r„) has either even or 
odd parity then the expectation value of the dipole moment is zero, i.e. 



Ji//*p^ di"! . . . dr n = 0. 



This show that in the dipole approximation transitions are allowed only 
between even and odd states. 

4 Consider a tightly bound collection of particles which behaves as a 
single particle at a point r with S = 1 . The wave-functions may be written 
as a column matrix 



Hr,t) = 



and m s can take the values +1,0,-1. 
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Verify that the appropriate commutation rules are satisfied by the 
matrix representation 





S 2 =fi 



What are the eigenvectors corresponding to m s = 1 , 0, - 1 respectively? 
Show that the matrix representing So P is equal to 2ft 2 times the 3 x 3 
unit matrix as required. 

5 Consider a system composed of three electrons. If the spin interaction 
terms are ignored, the system Hamiltonian is invariant under a separate 
rotation of each spin co-ordinate and the spin momentum of each electron 
is conserved. For each electron s = \ . Using part (c) of problem (1 ), or 
otherwise, deduce that there are a quartet of system states with S = § and 
M s =\,\,—\,—\ and two separate doublets with S = $ and M s = \,-\. 

Obtain the values of 5 and M s for each of the following spin functions 
and confirm that the quartet is 

a(o lz )a(o 2z )oi(o 3z ) 

— [a(a lz )a(a 2z )(3(a 3z ) + a(a lz )/J(ff 2 z)«(o'3z) 
V3 

+ j3(a lz )a(a 22 )a(a3 Z )] 

"/=• W(oizM°2zMo 3z ) + j3(a lz )a(0 2 *)/Ko3 Z ) 
+ a(0i z )/3(o 2z )j8(a 3z )] 

P(°lzM°2zM°3z) 

and that the two doublets may be written 

-J= [<*(°lz)<x(°2z)fc°3z) + a(Olz)0(°2z)<*(O3z) 

- 2p(o l2 )a(o 2z )a(o 3z )] 

"y| lP(OlzM°2zM°3z) +P(°lzM°2zM°3z) 

-2a(o lz )0(a 2z )0(o 3z )] 
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and 

— p [a(o lz )a(o 2z )P(o 32 ) -a(<r lz )0(a 2z )a(a 3z )] 

~E [/3(oi z )/3(a 2z )a(a 3z ) - 0(a lz )a 2z )/3(a 32 )] . 

Observe that the first doublet is symmetric in the interchange of the 
second and third particles whereas the second doublet is antisymmetric 
in these particles. 

6 Consider a linear box of unit length bounded by walls of infinite 
potential. If the origin is taken at one end of the box confirm that the 
wave-functions for an electron inside the box is 

i//„(x) = \/l sin (nnx) n = 1 , 2, 

Write down the spatial part of the determinantal wave-function i//(jci, x 2 ) 
describing two electrons, with identical spin, in the two lowest energy 
states. If one electron is at the point x = 3, the probability distribution 
of the other is 

**CUa)lKi,*2). 
Show that this is equal to 



2 



sin itx 2 — sin 2nx 2 

\/2 



Sketch this function and explain how it illustrates 'spin correlation'. 
Compare this with the distribution obtained from a simple product 
wave-function. 



7 Show that the expectation value of the Hamiltonian (1 1 .2) obtained 
using the Hartree product (1 1 .24) is 

2 e k -2 2 J* f 1 1®, (r,) | 2 l<}> Qk (r k ) | 2 , , dr, dr fc . 

This shows that the energy of the system is less than the sum of the single 
particle energies. 
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8 In the approximation that the nuclei are fixed (Born-Oppenheimer) 
the Hamiltonian for the hydrogen molecule is, ignoring spin terms, 



2m 



2m 47re \r la r lb r 2a r 2b ) 



e 2 



+ +■ 



4ne r 12 47re # 

where r Xa , r lb are the distances of the first electron from the protons 
which are at V and V and similarly r^, r 2b are the distances of the 
second electron from the protons. The separation of the electrons is r 12 
and the protons are a distance R apart. 

Let fl (r), 4> b (t) be the real normalized hydrogen (Is) wave-functions 
centred on the protons at V and V respectively. A first approximation 
to the ground state wave-function for the molecule will be obtained by 
considering the four normalized determinantal wave-functions of the type 



<Kri,o l2 ;r 2 ,a 2z ) = 



0aOl)aOlz) <t>a(j 2 )a(02z) 

6 (ri)a(a lz ) 0&(r 2 )a(a 2z ) 



V2(l-A 2 ) 
where 

A = \<Pa(r)<t> b (r)dT. 

Show that from these it is possible to construct the singlet state with 
symmetric spatial part 

*»('i. 'a) — /„„ A2 x [tfaO-i^fo) + * fl (r 2 )*&(ri)] 
V2(l + A 2 ) 

and a triplet state with antisymmetric spatial part 

4>A(ri, r 2 ) = A/1 A , x Wfa('i)fe('2) - a (r2)0ft(ri)]- 
V2(l - A 2 ) 

Show that the energies of the system corresponding to the singlet and 
triplet states are respectively 

/ K 

E s = 2E(ls) + -r + -s 

s l +A 2 l +A 2 

/ K 

E A = 2£(ls) + n -j 

-* v ' 1-A 2 1-A 2 
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where E(ls) is the energy of the hydrogen (Is) state and 



J = trl\*°^ 



47re, 
xdTi dr 2 , 



(r 2 ) 



1 



1 1 1 

— + — + 



r \b r 2a 



r 12 



R 



K ~-£;\\^^ 



r lb r 2a r \2 & 



a (ri)0&(r2) 



0a(>"2)^(ri) 



XdTi dr 2 

Show that 

,2 /• ,2 



2e 2 ffeVi) . + e 2 ff a 2 (r 1 )0|(r 2 ) 

/ = - drj + dr i dr 2 

4ne J r lb 4ne J J r n 



4tie R 



and that 



K = 






2£ 

47re, 
xdr : dr 2 + 



r 16 47T6, 

e 2 A 2 



JP^ 



)0&(r2)06(ri)0 a (r 2 ) 



'12 



47re i? 



The first term in AT is dominant and so K < 0. Explain why this means 
that the singlet state lies deeper than the triplet state. (Compare this 
result with that obtained for the helium atom.) 

9 Consider two particles each of mass m at positions rj, r 2 respectively 
which interact with one another through a potential K(|r! — r 2 |), which 
depends only upon their separation. The time-independent Schrodinger 
equation for the system is 



1m 



(V 2 +V 2 )£2+ K(|rj -r 2 \)Sl=E£l 



where SI(ti, r 2 ) is the spatial part of the state function. Following the 
argument from equations (4.45) to (4.51) show that the relative motion 
of the particles is described by the equation 



3 2 i// 



1 



r 2 dr \ drj r 2 sin 2 6 deb 2 r 2 sin 6 30 



3 / . .30 
39 



2 M 



(£■,-^=0 
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where /! = m/2 and r, 9, <j> are the polar co-ordinates of the second particle 
relative to the first. (Note r=\r 1 - r 2 1 .) E s is the 'internal' energy of 
the system. 
Show that if 

lKr.0, 0)=/i(r)e(0)*(*) 

it is necessary that 



d 2 $ 

d<j> 2 



= -m 2 <I> 



1 d / „d©\ m 2 



sin 6 — - - —^5- + X© = 



sin d0 I d0 / sin 2 



r 2 dr r drfr 2 S 2 



jzKtt-^+itP.-^- 



where m, X are separation constants. 

Following the argument leading to equation (4.59) show that for a 
physically acceptable solution 



1 d / , dR . 
V — + 



*p +%«.-*> 



r 2 dr\ dr 
with 

l>\m\ 
and m is an integer. 

10 In question 9 assume the 'Hooke's law' potential 
V(r) = ^(r-r ) 2 + V 

where V , k, r are constants. In equation (i) substitute 

S(r) 
R(r) = ~ ±J and P=r-r 
r 

and hence show that for those states with 1 = 

-£0 ♦!'*-«■ -™ 

Deduce that the allowed vibrational energies are 
E s = V +{n + h)h i F- n = 0,1,2,.... 



R = (i) 
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These energy levels are an approximation to the vibrational energy 
levels of a diatomic molecule. (The rotational levels are given by (4.60).) 
A better approximation would be obtained by assuming the Morse 
potential 

V(r) = V [e-^ r - r " )la - 2e- (r - r ° )/a ] 

where Vq, r , a are constants. 

1 1 The spin functions a(a z ) and j3(a z ) may be both chosen to have 
even parity p = +1 (see Heine's book). Deduce that a configuration^ 
wave-function (including spin) has the parity (— l) £ft , where /,• is the 
angular momentum of the rth particle. The complete atomic Hamiltonian 
including spin terms is invariant under the inversion operator. Use 
perturbation theory to deduce that configuration interaction only takes 
place between configurations with the same parity. 

12 Show that following configurations include only those terms given. 

(a) (2s) 2 >S 

(b) (2p) 3 4 S, 2 D, 2 P 

(c) (3d) 2 3 F, 3 P, l G, i D, l S 

1 3 Explain why the eigenfunctions belonging to the term (L, S) can be 
characterized by the four quantum numbers L, S, J, Mj. 

The spin-orbit coupling term in the many-electron Hamiltonian can 
be written (see Heine's book). 

A(L, S)[& x s x + £ y sy + Se z s z ) 

where £P X , s x etc. are the system momentum operators. Deduce that the 
energy of the wave-function defined by L, S, J, M, is 

tfo + \A{L, S)[J(J +l)-L(L + l)-S(S+l)]h 2 

whereto is the term energy ignoring the spin-orbit coupling. (Hint: 
Compare the fine-structure theory leading to equation (10.31).) 

Show that the energy difference between fine structure components 
with 5/ = 1 is equal to A{L, S)Jh 2 . This is Lande's interval rule. 

1 4 Confirm the commutation rules (1 1 .67), (1 1 .68) and (1 1 .69). 

1 5 Use the closure relation 

2 0e / (x)0e I .(x o ) = S(x - x ) 
i 

to show that the field operator ^(xq) creates a particle with defined 
values of the co-ordinates (including spin) x . 
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1 6 Show that the right hand sides of (1 1 .87) and (1 1 .88) agree with 
the right hand sides of (1 1 .74) and (1 1 .83) respectively. 

17 Obtain the commutation rule (1 1 .89) for Bosons. 
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CHAPTER 12 



The Dirac Equation 



12.1 Introduction 

The quantum mechanics described in the previous chapters is essentially 
non-relativistic and has been obtained by quantizing the appropriate 
classical expressions. It is necessary t^ develop the quantum theory to 
obtain a description that is valid for particles with speeds approaching 
that of light. The present chapter deals only with the relativistic theory 
for a single particle. A satisfactory many-particle theory requires quantum 
field operators and is beyond the scope of this book. 

A useful feature of the relativistic equations is that particle spin is 
built into the theory and does not have to be added in the way that 
Pauli added the spin concept to Schrodinger's theory. 



12.2 The Lorentz Transformation 

According to the principle of special relativity, the velocity of light as 
measured by an observer is independent of the motion of the observer. 
Consider two inertial co-ordinate frames of reference S and 5" such that 
S' moves relatively to S at the constant speed v along the positive x-axis. 
The x'-axis coincides with the x-axis. (x, y, z, t) and (x',y',z',t) are the 
space and time co-ordinates referred to the reference frames S and S' 
respectively. 

Suppose that at the instant / = 0a spherical electromagnetic wave 
leaves the origin of S which at that moment coincides with the origin of 
S'. The speed of propagation of the wave front is isotropic and equal to 
the same constant value c in both reference frames and so 

x i +y i + z *-c 2 t 2 =0 = x" L +y' 1 +z' 2 - cV 2 (12.1) 
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Figure 12.1 



The transformation from (x, y, z, t) to (x', y', z', t') that satisfies 
(12.1) is 



x' = (i(x-vt), y'-y, z' = z, t' = $\t 



-?*) 



(12.2) 



with 



Such a transformation is called a Lorentz transformation. Strictly this is 
a proper Lorentz transformation as it does not involve an inversion in 
space or time. When v < c, (12.2) reduces to the classical Galilean 
transformation 



x = x - vt, y =y, z = z, t =t. 
The transformation (12.2) can be inverted to give 



x = ${x + vt'), y=y',z- 



i 
z, 



t~p 



H' 



(12.3) 



(12.4) 



A rotation of axis in three-dimensional space leaves the quantity 
x 2 +y 2 +z 2 invariant. If a co-ordinate Tvs, defined by T = ict then a 
Lorentz transformation is a linear transformation that leaves the quantity 
x 2 +y 2 + z 2 + T 2 invariant. A proper Lorentz transformation may be 
regarded as a rotation in four dimensional (Minkowski) space-time and 
the ordered 4-tuple (x, y, z, T) is a vector the length of which remains 
unchanged under a Lorentz transformation. Clearly the ordinary rotations 
in (x, y, z) space are a special type of Lorentz transformation. 

Special relativity requires that the laws of nature take the same form 
in all inertia! frames of reference. This can be expressed mathematically by 
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demanding covariance of the appropriate equations under Lorentz 
transformations. This means that all terms in a given equation transform 
in the same way under a transformation and so have the same tensor 
rank. 

Consider the Schrodinger equation for a free particle of mass m. 

_ V 2 * = zfi — . 
2m dt 

The operators V 2 and b/dt are invariant under a linear transformation 
of (jc, y, z) alone and so this equation is covariant under a spatial rotation. 
However it is clearly not covariant under a Lorentz transformation of the 
type (12.4). It may involve second derivatives involving the time after the 
transformation is applied. This result is not surprising as the Schrodinger 
equation is obtained from a non-relativistic, classical Hamiltonian. 



12.3 The Klein-Gordon Equation 

A relativistically invariant quantum theory must be based on a relativistic 
Hamiltonian. 

Consider a free particle of rest mass m moving relative to an observer 
with speed v. The ordered 4-tuple 

iff 

Px,Py,Pz, 

where the first three entries are the momentum components, 
p x = 0m o i p y = &m y p z = pm z 




and H is the Hamiltonian, transforms as a vector in four-dimensional 
space-time and its length is invariant under a Lorentz transformation so 
that 

Pi + Pi +Pz-^T= -mW. (1 2-5) 

The relativistic Hamiltonian is given by 

H 2 = (P 2 + Pi +Pi)c 2 + mlc*. (12.6) 

The quantum mechanical Hamiltonian is obtained by the usual substitution 
for the linear momentum components. 

jf2 = -c 2 ft 2 V 2 +mfc 4 . (12.7) 
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The usual time-dependent Schrodinger equation (postulate 3, Chapter 3) 
is impossible to apply because of the square root in the Hamiltonian. 
However, since the operator «3f is equivalent to the operator 

ot 
then a suitable equation of motion is 

.*** = _*»!?* (128) 

at 



i.e. 

1 3 2 * 



|V 2 * 



C 2 at 2 



^ *• (12-9) 



Note that this equation is second order in time. It is well-known that the 
operator 

V c 2 ot 2 

is invariant under a Lorentz transformation and so equation (12.9) is 
Lorentz covariant as m and c are constant scalar quantities. 

This equation was discovered by Schrodinger (1926) before he 
discovered the non-relativistic equation that bears his name. However 
as it did not give good agreement with the known hydrogen spectrum 
he discarded it. Klein and Gordon (1926) studied equation (12.9) and it 
carries their names. 



Equation of continuity 

In section 5.5 it was shown that in the non-relativistic theory, the 
probability density P and the probability current density S satisfy the 
equation of continuity at each point of space, i.e. 

^ + divS = (12.10) 

where 

p= ■qt*-q/ 



and 



iPi 

S = - [*grad**-**grad*]. (12.11) 

2wq 
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In the same way by multiplying the Klein-Gordon equation (12.9) by 
ty* and subtracting from this result its complex conjugate, the equation 
of continuity (12.10) is again obtained. However in this case the 
probability density is 

p = - — r *-^-**-H. ( 1212 ) 

2im c 2 \ bt btj 

with the probability current density taking the Schrodinger form (12.11). 

The Klein-Gordon probability density (12.12) introduces a difficulty 
that does not arise in the Schrodinger case. The Schrodinger i > (12.1 1) is 
nowhere negative as expected for a probability density. Unfortunately 
this is not the case for the case for the Klein-Gordon probability density 
(12.12). This expression can be positive, zero or negative. This arises 
because the Klein-Gordon equation, unlike the Schrodinger equation, is 
second order in time and the values of ^ and d^jdt at a given instant can 
be arbitrary. Even if these values were chosen to make P > at all points 
of space at some initial time it would not follow that P would not 
become negative at a later time. It was some time before this curious 
result was interpreted. The quantitites eP and eS can be regarded as the 
electric charge density and current density respectively and it is reasonable 
for both to be positive or negative if both signs for charge occur. Also, at 
high energies there is the possibility of creation and annihilation of particle 
pairs so that the number of particles is not conserved. The charge density 
eP depends on the relative number of positive and negative charges and 
so may be positive or negative. It can be shown from the equation of 
continuity that the total charge is conserved, i.e. 

J eP dr= constant. 

The integral is over all space co-ordinates. 

In explaining the change in sign of eP and eS it has been necessary to 
allow the presence of both positive and negative (and neutral) particles 
and this is essentially a many-particle theory. Also, it can be shown that 
in the non-relativistic limit of low energies the Klein-Gordon equation 
reduces to the Schrodinger equation without spin. The solutions of the 
Klein-Gordon equation describe particles with zero-spin. Rons are such 
particles. However the Klein-Gordon equation certainly cannot describe 
particles with spin \ and so cannot describe the motion of electrons. 
Dirac discovered the correct equation for electrons. 



12.4 The Dirac Equation 

By an ingenious method Dirac in 1928 succeeded in obtaining a linear 
relativistic equation which is first order in time. For a free particle, the 
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relativistic Hamiltonian is given by 

^ =(pl+P 2 y +Py)c 2 +mlc\ (12.13) 

where p x , p y , p z represent the usual quantum operators. 

The basic postulate of Dirac was to express the right side of (12.13) 
as a perfect square so that the relativistic quantum Hamiltonian is 

3f = c(a x p x + UyPy + a z p z + (Sm c) (1 2.14) 

with a x , a y , a z , j3 independent of position, momentum and time. The 
negative root could be taken for if but this choice does not lead to any 
extra solutions. From (12.13) and (12.14) it is clearly necessary for 

c 2 (<*xPx +a y P y +* z p z +Pm c) 2 = {pl + p 2 + pj)c 2 +mgc 4 . 
It follows immediately that 

fi=a 2 =a 2 =a 2 = l (12.15a) 

and that any pair of a x , a y , <x z , anticommute, i.e. 

a x a y +a y a x =0 

a x P + 0a x =O, etc. (12.15b) 

It is clear from (12.15) that the a's and (1 cannot be simply numbers 
as they do not commute. They are in fact operators and can be 
represented by square matrices a*, a y , a z and 0. The Hamiltonian is 

ft 
3^ = c-a.giad+^m c 2 (12.16) 



where 

9 9 8 

bx y by bz 



a.grad=o Je — + a y — + a z 



The time-dependent equation (3. 10) becomes 

|cTa.grad+j3m c 2 | * = /ft— . (12.17) 

Equation (12.17), which is first order in both space and time co-ordinates, 
is relativistically co variant and is Dirac's equation for a free particle. It 
must be understood that * is a column matrix representing a many- 
component wave-function. 

As the Hamiltonian must be Hermitian then the a and (3 matrices 
must also be Hermitian. Hermitian matrices that satisfy the commutation 
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relations (12.15) can only be found if they have at least 4 rows and 
columns. One possible choice is to diagonalize jSwith 



\ /I 



and then 




0-1 / \0 -I 






(12.18) 



1 



-1 \ _/0 a z \ 

1 \o z / 



^0 -1 0, 

where a x , a y , a z are the Pauli spin matrices. This choice is certainly not 
unique. Any new set of matrices obtained by premultiplying the above 
by an arbitrary unitary matrix S and post multiplying by S" 1 will also 
satisfy the necessary conditions. The choice (12.18) is called the Dirac 
representation. 

These matrices operate on a four-dimensional 'spinor space' which is 
spanned by a set of four basis functions. The Dirac wave-function 
satisfying (12.17) is represented in this space by a column matrix. 



*(r,0= 02.19) 
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The */(r, f)are the components of the wave-function in the 'spinor- 
space' and the Dirac equation (12.17) is equivalent to four simultaneous 
first order equations that are linear and homogeneous in ^' 1 , * 2 » *3 an d 
* 4 . Each of these component wave-functions separately satisfy the 
Klein-Gordon equation (12.9). 

For stationary states, the time dependence can be separated from the 
spatial part by writing 

*(r, f) = $(r)e- iEt f (12.20) 

where *(r, r) and <£(r) are both four -component column matrices and 
from (12.17) 

jcja.grad + |3woC 2 )«Kr)=£'<K')- 02.21) 



Probability and current density 

The Dirac equation, unlike the Klein-Gordon equation, leads to a 
satisfactory equation of continuity with a non-negative probability 
density. The time-dependent Dirac equation for a free particle is 

z% — + Ktfa.grad*-/3m o c 2 *=0. (12.17) 

ot 



This is a matrix equation and its Hermitian conjugate is (take the 
complex conjugate and transpose and note (oj )* = a y , etc.). 

9*+ 

-ft—- ich grad * + . o - m c 2 * + /3 = 0. (1 2.22) 

at 

* + is the row matrix Hermitian conjugate to *, i.e. 

* + (r, t) = (*f (r, 0, *J(r, t), *J(r, r), *&, t)). 

Observe that in obtaining (12.22), the order of matrices in the second 
and third terms of (12.17) has been reversed. 

The equation of continuity is obtained in the usual way. Premultiply 
(12.17) by * + and postmultiply (12.22) by * and subtract to obtain 



9* 94> + \ 
* + — + — — * + icti (**a. grad * + grad * + .ct*) = 
at at J 



i.e. 



m 



— (* + *) + c div (# + a4$ = 0. (1 2.23) 
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This is the equation of continuity with probability density 

p=**»=|¥il 2 +l*2l 2 + l*3l 2 + l*4l 2 (1224) 

and probability current density 

S = c* + a*. (12.25) 

Equation (12.24) ensures that the probability density is nowhere 
negative and is the expected result. 

The expression representing the probability current density is very 
interesting as the eigenvalues of a are ± 1 . The operator representing the 
velocity components of the particle is 

i= l -[^,t]=ctt (12.26) 

n 

and so the eigenvalues off are ±c. An exact measurement of a velocity 
component must give ±c. This represents a 'shuddering' of the particle 
with speed c. Trigg (Quantum Mechanics), examines this result in more 
detail and suggests that it is meaningless to specify the position of the 
particle to an accuracy of less than its Compton wave-length h/m c. In 
this case the uncertainty principle would require that the uncertainty 
in the energy should be of the order of 2 m c 2 and this would be 
sufficient to produce a pair of particles (see last section). 

The equation of continuity (12.23), the probability density (12.24) 
and the probability current density (12.25) remain valid in the presence 
of an electromagnetic field. 

Covariance of the Dirac equation 

The Dirac equation can be shown to be covariant under a Lorentz 
transformation. The reader is referred to Group Theory and Quantum 
Mechanics, by Heine, and Quantum Mechanics, by Davydov, for a 
discussion. 

Plane wave solutions 

It is expected that the free-particle Dirac equation (12.17) will have 
plane wave solutions of the type 

^ j (t,t) = a j ex V i(k.i-Et/h) /= 1,2,3,4 (12.27) 

with 

k.T = xk x +yk y +zk z (12.28) 



THE DIRAC EQUATION 319 

and a f are constants. Substitution of (12.27) into (12.1 7) gives the four 
simultaneous equations 

(m c 2 - £>i + chk z a 3 + ch(k x - ik y )a 4 = 
(m c 2 - E)a 2 + ch(k x + ik y )a 3 - chk z a 4 = 
cKk z a x + ch(k x - ik y )a 2 - (m Q c 2 +E)a 3 = 
ch (k x + ik y )a x - chk z a 2 - (m c 2 + E)a A = 0. (1 2.29) 

For a non-trivial solution, the determinant of the coefficients of the 
a,- must vanish, i.e. 

(E 2 -mlc*-c 2 h 2 k 2 ) 2 =0 (12.30) 

where 

k 2 =k 2 x +k 2 y +k 2 z . (12.31) 

The energy can take the two values (both double roots) 

E + = +(m 2 c 4 +c 2 fi 2 k 2 ) 1/2 

E_=-(mlc 4 +c 2 h 2 k 2 ) in (12.32) 

There are no allowed energy states between ±moc 2 . 

There are two linearly independent solutions corresponding to the 
positive energy E+. In a non-normalized form, they cap be chosen to be 

chk z _ ctt(k x +ik y ) 

fll=1 a2=0 a3= m c 2 +E + " 4 = m c 2 + E + 

(12.33) 

and 

ch(k x - ik y ) _ -chk z 

fll=0 * 2 = 1 * 3= m c 2 + E + fl4= m c 2 + £/ 

(12.34) 

Similarly, for the negative energy #_ , there are two linearly 
independent solutions 

-chk z -ch(k x +ik y ) 

a l = 2 £~ fl 2 = 2 7T~ «3-l «4-0 

m c 2 - £_ m c - E- 



-ch(k x - iky) chk z 

m c 2 -E_ a2 ~ m c 2 - E_ 



ai ^T" ^"^c 2 -^ " 3 ~° * 4 L 

(12.35) 
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In the non-relativistic limit E+ ~ m c 2 and the positive energy solutions 
(12.33) have two 'large' components a x , a 2 and two 'small' components 
of order u/2c. The negative energy solutions then have two 'small' 
components a lt a 2 and two 'large' components a 3 , a 4 . 
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Figure 12.2 Continuous energy spectrum. 

The energy gap between ±m c 2 occurs in 'classical' relativistic theory 
(12.6). However these negative energies can be ignored in classical theory 
as energies can only change continuously and a particle whose energy is 
positive at some instant will remain so. This is not the case in quantum 
theory as the particle energy can change discontinuously. The relation 
between positive and negative energies will be discussed later. 



Introduction of spin 

For a free particle it is expected that angular momentum will be con- 
served. However it is simple to show that orbital angular momentum is 
not a constant of the motion. 

In the matrix representation the operator representing the z- 
component of orbital angular momentum is 

'••rte-'sr)' (1 " 6) 
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where I is the 4 x 4 unit matrix. The free particle Dirac Hamiltonian is 



&? — j c — o . grad +/3m c 2 



(12.37) 



The second term in the Hamiltonian is diagonal and so commutes with 
S£ z . It is simple to show that the commutator 



W,&A = 



c 7 «.grad, 7 ^--^-|I 






= -ft 2 c(«Agrad) z . 



(12.38) 



As SP Z an d ^do not commute then the z -component of orbital 
angular momentum is not conserved. The generalization of (12.38) to 
include the two other components gives 



[5£.2?]=-ft 2 c(aAgrad). 
Now consider the matrix operator 

'l 0' 

0-10 \ la z 
o T = | 

10 

k 0-1 
It is not difficult to show that the commutator 



(12.39) 



(12.40) 



2 



<?r. „ o z 



= +ft 2 c(oAgrad) z . 



By defining 



H:-:J H:l) 



it can similarly be shown that 

= +ft 2 c(aAgrad) 



*|rf 



(12.41) 



(12.42) 



(12.43) 



where a has the three components a' x , a' y and o z . 
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By combining (12.38) and (12.43) then 



and 



^S'+yo' 



2 



:0 (12.44) 



is a constant of the motion. 

By analogy with the Pauli spin theory a spin operator S' is defined by 

S' = |o' (12.45) 

and the total angular momentum J is defined by 

J=.S?+S'. (12.46) 

The components of both S' and .2? satisfy the necessary commutation 
relations for angular momentum operators (10.4). From (12.44), total 
angular momentum is conserved for a free particle. 

The operators a' x , a' y ,o' z have eigenvalues ± 1 corresponding to 
eigenvalues ± fi/2 for the spin operators Si, S^,, S' z . The Dirac equations 
describes particles whose spin along any specified direction can only take 
the two values ± ti/2. An electron is such a particle with spin \ . (Other 
such particles are protons, neutrons, etc.) 

Consider the plane wave energy functions obtained in the previous 
section. In the non-relativistic limit, putting the small components equal 
to zero, the column vectors representing the two solutions with energy 
E+ are, from (12.33) 



and 



These are simultaneously eigenvectors of S^ with eigenvalues +fi/2 and 
— h/2 respectively. Of course these vectors are not eigenvectors of S* or 
S y as these operators do not commute with S^. 

Similarly, the 'non-relativistic' eigenvectors with energy E_ are, from 
(12.35) 



and 
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and are simultaneously eigenvectors of S* with eigenvalues + h/2 and 
— fi/2 respectively. 

12.5 The Dirac Particle in an EM. Field 

An electromagnetic field is described completely when the vector 
potential A and the scalar potential are given. It can be shown 
(Classical Electricity and Magnetism, by Panofsky and Phillips), that the 
ordered 4-tuple 



A A &-) 

c J 



is a vector in Minkowski space and its length is invariant under a Lorentz 
transformation. 

The momentum p x conjugate to x for a particle (electron) with charge 
-e in an electromagnetic field is given by ((1 .43) holds in relativistic 
mechanics) 

Px =pm x-eA x (12.47) 

and similarly for the other two components. The 'classical' relativistic 
Hamiltonian for the particle in the field is obtained from the free particle 
Hamiltonian in the usual way by substituting for the linear momentum 
in terms of the new conjugate momentum (12.47) and also replacing the 
Hamiltonian by H + etj>. From (12.6) 

(H + e<t>) 2 = c 2 [( Px + eA x f + (p y + eA y ) 2 + ( Pz + eA z ) 2 ] + mgc 4 . 

(12.48) 

A quantum-mechanical wave equation is obtained by the substitutions 



and 



d 
H-*m — 

dt 






h a 


[ih — + e<p\ * = c 2 


[rh +eA *) 2 + [jh +eA >f 


\jh +eA ) 2 


+ mlc 2 


*. 



(12.49) 

This relativistically covariant equation is the Klein-Gordon equation for 
a particle of charge — e in an electromagnetic field and will not be 
considered further. 
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The Dirac equation for a particle of charge -e in the field is obtained 
in a similar manner. From (12.17) 



ihl — + le<t>-ca. It grad +eA) -j3w c 2 



* = 0. 



(12.50) 



This equation is also relativistically covariant but its solutions are not 
solutions of the Klein-Gordon equation unlike the field-free case. To see 
this operate on (12.50) using 

., , 3 „ I ft \ « 

J«I — + Ie^+ca.l — grad +eA \+pm c* 

to obtain a similar but different equation to (1 2.49). 
To find the stationary states of the Dirac equation put 

*(r,f) !E <K')e~ arr/ * 
in (12.50) and then 



<f>(r) = 0. (12.51) 



IE + 1 e<p - ca, . ( — grad + e A | - j3m c 



It has already been demonstrated that in the non-relativistic limit the 
four-component Dirac wave-function can be divided into large and small 
components. With this limiting case in mind put 



<K') : 



where Xi and Xs are two-component column vectors. With this substituion 
equation (12.5 1) becomes 



\[E + e<t> - m c 2 ] 
\[E + e<j> + m c 2 



with 



X; - ca. It grad +e\ W*=0 
]%s ~ co . I - grad + eA I X; = 

r 



(12.52) 



+ oA~ — + eA 2 
I az 

Consider a positive energy solution with 

E = m c 2 +£"'. 
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Eliminating \s fr° m (12.52) 



o.j Tgrad +eA I I 

\ l , ; —-le<t> [Xi = lE'xi. 

2m c 2 +E' + e(t> J A 
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(12.53) 



In the non-relativistic limit of small kinetic energy and if the field is 
weak then E' < m c 2 and e0 < m c 2 and so 



1 



1 



(2m Q c 2 +E' + e<j>) 2m c 2 



1 



E' + e<(> 
2m c 2 



Keeping the first term only of this binomial expansion (12.53) becomes 



2m 



o . J — grad + eA 



-le<l>\xt = lE'x,. 



(12.54) 



This is a two-component equation and gives the large components Xi- 

In this non-relativistic limit the magnitude of the momentum is given 
by m V where Vis the electron speed and the second of the equations 
(12.52) becomes 

Xs -. „ Xz • 

2 c 

This result confirms that Xs is 'small' compared with Xi- As the operator 
grad +e\ 



I -7 grad +e\ \ 



commutes with the components of o then it can be shown that 
(problem 6) 



'■0 



a . I — grad + e A 



grad+eA| +io. 



A | — grad + e A 



j =ll-7grad+eAj 
.1 =II-7grad+eAj 



I Tgrad +eA j 



grad+eA +effo.curlA. (12.55) 



The magnetic flux 

B = curl A 

and so in the non-relativistic limit, the two large components of the 
Dirac equation satisfy 



2^i7 grad+eA ) 



grad+eA) — le<j> 



2m n 



o.B 



X, =\E'x,. (12.56) 
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This equation was suggested by Pauli for the electron. It is of course 
simply the equation obtained by combining the Schrodinger non- 
relativistic theory and the Pauli electron spin theory of Chapter 10. 
(c.f. (8.24), (10.24), (10.25)). The Pauli theory is obtained in the non- 
relativistic limit (i.e. small velocity) from the Dirac theory. The final term 
on the left hand side of (1 2.56) implies that the particle has an intrinsic 
magnetic moment of 

-eh 
a 



2 Wo 



in agreement with the value assumed in Chapter 10 for the electron. 
Clearly the particles described by the Dirac equation are electrons. 

Coulomb field 

A Coulomb field due to a charge of +Ze can be described by 

A = = -pk 
4ne r 

The Dirac Hamiltonian is then 

h — Ze 2 

3tf = c — ct.grad +&m c 2 . 

i 47re r 

The solutions of the corresponding equation is dealt with in detail by 
Davydov (Quantum Mechanics, p. 266). It is sufficient here to point out 
that the total angular momentum as defined by (12.46) is conserved for 
a particle in a central field. The reader is asked to verify this himself. 



12.6 The Positron 

It has been shown that the Dirac equation has both positive and negative 
energy states and this leads to a difficulty in interpretation. These nega- 
tive energy states correspond to the negative square root of the classical 
'relativistic' expression (12.6). However, in classical theory a particle 
energy changes continuously and so a particle in a state with positive 
energy will remain so and the negative energy states can be ignored. This 
is not the case in quantum mechanics as a particle can make discontinuous 
jumps in energy and so cross the 'forbidden' gap 2m c 2 . There are an 
infinite number of negative energy states. In this case it would seem 
reasonable to expect a Dirac particle to fall to lower energy states 
radiating continuously. However electrons are not observed to do this 
and some other interpretation of the negative energy states must be 
forthcoming. 
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In 1930 Dirac proposed that in the normal state the negative energy 
states are filled and assuming the particles obey the Pauli principle a 
particle with positive energy cannot jump to a state with negative 
energy. The charge density due to the filled negative energy states must 
be assumed to be unobservable. A vacuum is then a space in which all 
negative energy states are filled and all positive energy states are empty. 
Only deviations from the vacuum state are observed. 

A positive energy solution of the Dirac equation (12.50) corresponds 
to an ordinary negatively charged particle (electron). A negative energy 
state is only observed when it is empty. Now a filled negative energy 
state corresponds to a particle with both negative charge and negative 
mass. Consequently the observed empty negative energy state appears as 
a particle with equal positive charge and positive mass. This 'anti-particle' 
appears as a hole in the 'sea' of filled negative energy states. The energy 
of the anti-particle is positive as it corresponds to the energy required to 
lift a negatively charged particle out of the negative energy state. 

It is quite easy to relate the behaviour of an electron to its anti- 
particle. The Dirac equation for an electron of charge — e and rest mass 
m in an electromagnetic field is 



ih\ — + 1 e<t> - co . I — grad + e A I — &m c 2 
bt \i ) 



*=0. (12.57) 



It is useful to choose a representation in which the o matrices are all 
real and |3is imaginary. A choice satisfying the conditions (12.15) is to 
take a x and a z as before (12.18) and interchange a y and 0to obtain 





I 








«y = \ 


1 





-1 








\o 





-1 




With this choice take the complex conjugate of equation (12.57). 

** = 0. 



ihl — + l(~e)(t> - co . I t grad + (-e)A J -/3w c 2 



(12.59) 



If in (12.57)* describes an electron with charge — e, then** 
describes a particle with equal mass but opposite charge +e. 



328 A FIRST COURSE IN QUANTUM MECHANICS 

Consider now the stationary states. The Dirac equation for an 
electron with energy E is 



IE + le<t>-ca.\r grad + e A ) -/3m c 2 



*=0. (12.60) 



Take the complex conjugate of this equation. 

I(-E) + l(-e)0 - cot . I - grad + (-e) A J -fim Q c 2 ** = 0. 

(12.61) 

If in (12.60) * describes an electron with negative energy (E < 0) 
then4>* describes a positively charged particle with positive energy. 
The electron and its antiparticle appear symmetrically in the theory. The 
pair of equations (12.60) and (12.61) should be considered together and 
the negative energy solutions of one correspond to the positive energy 
solutions of the other. 

From this discussion it is seen that the electron and its antiparticle 
possess the same mass, spin and differ only in the sign of their charge. 
The electron antiparticle was discovered in 1932 and is called the 
positron. 

It is observed that a positron always appears as a pair together with 
an electron and that an energy greater than 2m c 2 is absorbed in the 
pair creation. This is explained as in pair creation an electron must be 
excited from a negative energy state through the forbidden gap of 
2ntoC 2 to a positive energy state. The empty 'hole' manifests itself as 
the positron and the electron in the positive energy state behaves as an 
ordinary electron. 

The reverse process takes place when an electron falls into an 
unoccupied negative energy state. This occurs as electron-positron 
annihilation and the energy is emitted as photons. 

To explain the negative energy state it is necessary to extend the 
single particle Dirac theory to a type of many-particle theory. This is not 
very satisfactory. The difficulties can be overcome using the method of 
quantized fields but this development is outside the scope of this text. 
The second quantization formalism mentioned in the previous chapter is 
necessary to cope with the annihilation and creation of particles. 



PROBLEMS 

1 Equation (12.9) is the Klein-Gordon equation for a free particle. 
Consider the transformation 

*(r, t) = *„(r, t) exp (-im c 2 t/h). 
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In the non-relativistic limit E = E' + m Q c 2 with£" < m c 2 . Also in 
general 

3*o 






dt 
Hence show that 

— a ° * exp (-im c 2 tfh) 
ot fi 

and obtain a similar expression for 9 2 */9r 2 . 

Show further that in this approximation of E'< m c 2 the Klein- 
Gordon equation becomes 

2m 9? 

This is Schrodinger's non-relativistic equation for a free particle. 

2 Using the same transformation as in the above question and in the 
same non-relativistic limit, show that the Klein-Gordon probability 
density (12.12) reduces to the Schrodinger probability density (12.1 1). 

3 Show that conditions (12.15) are necessary for the square of (12.14) 
to be equal to (12.13). 

4 Show that the Dirac matrices a*, a y , ct z and Pare both Hermitian 
and unitary. Show that all four matrices have eigenvalues of ±1 only. 

5 Confirm that the total angular momentum operator (1 2.46) commutes 
with the Hamiltonian for a Dirac particle in a central field. 

6 If A and B are two vector matrix operators that commute with the 
Pauli spin matrices show that 

(o . A)(o . B) = (A . B) + /[o. (A A B)] 

where a. A = o^A* + a y \ y +c 2 A 2 , etc. Hence prove equation (12.55). 
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